[PDF] High-level Approaches to Detect Malicious Political Activity on Twitter

Abstract

Our work represents another step into the detection and prevention of these ever-more present political manipulation efforts. We, therefore, start by focusing on understanding what the state-of-the-art approaches lack -- since the problem remains, this is a fair assumption. We find concerning issues within the current literature and follow a diverging path. Notably, by placing emphasis on using data features that are less susceptible to malicious manipulation and also on looking for high-level approaches that avoid a granularity level that is biased towards easy-to-spot and low impact cases. We designed and implemented a framework -- Twitter Watch -- that performs structured Twitter data collection, applying it to the Portuguese Twittersphere. We investigate a data snapshot taken on May 2020, with around 5 million accounts and over 120 million tweets (this value has since increased to over 175 million). The analyzed time period stretches from August 2019 to May 2020, with a focus on the Portuguese elections of October 6th, 2019. However, the Covid-19 pandemic showed itself in our data, and we also delve into how it affected typical Twitter behavior. We performed three main approaches: content-oriented, metadata-oriented, and network interaction-oriented. We learn that Twitter's suspension patterns are not adequate to the type of political trolling found in the Portuguese Twittersphere -- identified by this work and by an independent peer - nor to fake news posting accounts. We also surmised that the different types of malicious accounts we independently gathered are very similar both in terms of content and interaction, through two distinct analysis, and are simultaneously very distinct from regular accounts.

Full PDF

F ACULDADE DE E NGENHARIA DA U NIVERSIDADE DO P ORTO

High-level Approaches to Detect

Malicious Political Activity on Twitter

Miguel Sozinho Ramalho M ASTERS T HESIS

Mestrado Integrado em Engenharia Informática e Computação

Supervisor: André Restivo, University of Porto, PhD

Co-Supervisor: Ashton Anderson, University of Toronto, PhD Co-Supervisor: Hugo Sereno, University of Porto, PhD

July 21, 2020

High-level Approaches to Detect Malicious Political

Activity on Twitter

Miguel Sozinho Ramalho

Mestrado Integrado em Engenharia Informática e Computação

July 21, 2020 i

Abstract

Democracy, as a manifestation of the primary desires of nations, rests upon the premise that political elections allow for the citizens of these nations to express their views based on facts and truth. Naturally, it is impossible for every person’s view to be clear and realistic. However, in recent years, the tools available to lengthen that gap have grown in reach and impact, far beyond any acceptable threshold. Online Social Networks (OSNs) are one such example. Indeed, these have become political weapons – Facebook and Twitter being among the most infamous cases. This problem was first prompted during 2016, in the context of the United States presidential elections, where a foreign nation was showed to have interfered – Russia. This example is not an exception. It was the inception of a problem that has come to stay. Our work represents another step into the detection and prevention of these ever-more present political manipulation efforts. We, therefore, start by focusing on understanding what the state-ofthe-art approaches lack – since the problem remains, this is a fair assumption. We find concerning issues within the current literature and follow a diverging path. Notably, by placing emphasis on using data features that are less susceptible to malicious manipulation and also on looking for high-level approaches that avoid a granularity level that is biased towards easy-to-spot and low impact cases. We designed and implemented a framework – Twitter Watch – that performs structured Twitter data collection, applying it to the Portuguese Twittersphere. We investigate a data snapshot taken on May 2020, with around 5 million accounts and over 120 million tweets (this value has since increased to over 175 million). The analyzed time period stretches from August 2019 to May 2020, with a focus on the Portuguese elections of October 6th, 2019. However, the Covid-19 pandemic showed itself in our data, and we also delve into how it affected typical Twitter behavior. We performed three main approaches: content-oriented, metadata-oriented, and network interaction-oriented. We learn t hat Twitter’s suspension patterns are not adequate to the type of political trolling found in the Portuguese Twittersphere – identified by this work and by an independent peer - nor to fake news posting accounts. We also surmised that the different types of malicious accounts we independently gathered are very similar both in terms of content and interaction, through two distinct analysis, and are simultaneously very distinct from regular accounts. Keywords : Twitter, Online Social Networks, Twitter API Data Collection, Political Trolling Detection ii

Resumo

A Democracia, enquanto manifestação dos desejos primitivos de nações, assenta sob a premissa de que as eleições políticas permitem aos seus cidadãos expressar a sua visão com base em factos e verdade. Naturalmente, é impossível que todas as pessoas tenham uma visão nítida e realista, mas os últimos anos foram marcados pelo aparecimento de ferramentas que aumentam esta discrepância, cujo alcance e impacto cresceu bem para além de níveis aceitáveis. As redes sociais online são um desses casos. De facto, estas tornaram-se armas políticas – sendo o Facebook e o Twitter os casos mais gritantes. Esta questão remonta a 2016, durante as eleições presidenciais norte-americanas, onde foi demonstrada a interferência de uma nação externa – Rússia. Este exemplo não é uma exceção, antes foi a génese de um problema que veio para ficar. O nosso trabalho é um mais um passo no caminho da deteção e prevenção destes esforços de manipulação política cada vez mais presentes. Portanto, focamo-nos em perceber quais as limitações atuais do estado da arte – se o problema prevalece, é correto assumir que estas existem. De facto, encontrámos problemas preocupantes na literatura e, portanto, decidimos seguir um caminho divergente. Particularmente, ao colocar a ênfase no uso de atributos dos nossos dados que sejam menos suscetíveis à manipulação maliciosa e também ao investir em abordagens de alto nível que procuram evitar o nível de granularidade que é tipicamente enviesado para casos óbvios de deteção e de baixo impacto. Concebemos e implementámos uma arquitetura – Twitter Watch – capaz de recolher dados do Twitter de forma estruturada, e aplicámo-la à Twittersphere portuguesa. Investigámos uma amostra capturada em maio de 2020, com cerca de 5 milhões de contas e mais de 120 milhões de tweets (atualmente este valor já ultrapassa os 175 milhões). O período analisado vai de agosto de 2019 a maio de 2020, com um especial foco nas eleições legislativas de 6 de outubro, de 2019. Contudo, a pandemia originada pelo Covid-19 manifestou-se nos dados recolhidos e também investigámos o seu impacto no comportamento anterior, na rede social. Levámos a cabo três abordagens principais em termos de análise: orientada a conteúdo, orientada a meta-dados, e orientada a uma rede de interações. Aprendemos que os padrões de suspensão do Twitter não são adequados ao tipo de comportamento de troll com que nos deparámos – neste trabalho e vindo de uma colaboração com um par académico – nem com contas que partilham fake news . De facto, também compreendemos que os diferentes tipos de contas maliciosas que recolhemos, de forma independente, são bastante semelhantes tanto em termos de conteúdo como interação, sendo estas análises independentes, e simultaneamente muito distintas de contas regulares. Keywords : Twitter, Online Social Networks, Twitter API Data Collection, Political Trolling

Detection iv iii

Acknowledgements

Two types of acknowledgments are due here. The first, explicit, goes to my three supervisors. Professor Ashton, thank you for believing in this work from the start, when we were one Atlantic ocean apart, and for still believing it when we were one kilometer plus one pandemic apart. Professors André and Hugo, thank you for showing me that people can stay strong against a system that is oriented towards quantity and not quality, for giving me the freedom to forge my own path, oh, and for those late-night meetings too. To all three, a big thanks for your support, tips, and feedback throughout this project. The second, implicit, is to all the people who were there for me in these past years, and to some few, way before that. Your contributions made me who I am today. Names are not written as I did not want to risk leaving anyone out. I thank those who showed me the patience, wisdom, love, and support I know I did not always deserve. Um sargento nunca abandona os seus soldados, Obrigado a todos, Miguel vi “We all woke up, tryna tune to the daily news

Looking for confirmation, hoping election wasn’t true

All of us worried, all of us buried, and the feeling’s deep

None of us married to his proposal, make us feel cheap

Still and sad, distraught and mad, tell the neighbor about it

Bet they agree, parade the streets with your voice proudly

Time passing, things change

Reverting back to our daily programs

Stuck in our ways, lust”

Kendrick L. Duckworth in Lust viii v

Contents

Glossary xv Abbreviations xvi Introduction 1

Context . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.2

Problem Definition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 1.3

Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 1.4

Goals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 1.5

Hypothesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 1.6

Structure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 Background 5

Online Social Networks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 2.2

Twitter – Rules of the Game . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 2.2.1

History . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 2.2.2

Application Programming Interface . . . . . . . . . . . . . . . . . . . . 9 2.3

Manipulation Strategies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 2.3.1

Misinformation and Disinformation . . . . . . . . . . . . . . . . . . . . 10 2.3.2

Fake News . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 2.3.3

Trolls vs Bots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 2.3.4

Manipulation Nomenclature . . . . . . . . . . . . . . . . . . . . . . . . 11 2.4

Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 Systematic Literature Review (SLR) 15

Systematic Literature Review Protocol . . . . . . . . . . . . . . . . . . . . . . . 15 3.1.1

Survey Research Questions (SRQs) . . . . . . . . . . . . . . . . . . . . 16 3.1.2

Search Methodology . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 3.1.3

Search Query . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 3.2

Search . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17 3.3

Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 3.3.1

SRQ1 – What data types and metrics are currently extracted? . . . . . . . 18 3.3.2

SRQ2 – What techniques are being used to process the different data types? 19 3.3.3

SRQ3 – To what end are the analyses conducted? . . . . . . . . . . . . . 21 3.3.4

Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 x

CONTENTS

4 Problem Statement

5 Twitter Watch

6 Exploration

7 Conclusions Epilogue vii References Appendices

A Example of a Twitter API User Object

B Fake News Websites List from Manual Investigation

CONTENTS xi C Uncovered Fake News Websites from Inspection of Known Fake News Posters

D Portuguese Newspaper Covers from October 19th 2019

E Portuguese Newspaper Covers from September 16th 2019

F Evidence of Malicious Activity in Manually Uncovered Trolls (LDA)

G LDA clustering – All Account Types Distributed per Cluster

H List of Capital to Country Analogy Pairs

I Mentions Embeddings Clustering – All Account Types Distributed per Cluster ix xii

CONTENTS

List of Figures

Welsh’s test with significantly different days of the week . . . . . . . . . . . . . 51 6.5 Boxplot and average daily tweets by day of the week, for Covid-19 . . . . . . . . 52 6.6 Daily suspensions detection rate from April 18 th st . . . . . . . . . . . 67

68 6.19 Punch card of troll accounts found with LDA . . . . . . . . . . . . . . . . . . . 69 6.20 Punch card of clickbait accounts found with LDA . . . . . . . . . . . . . . . . . 69 6.21 K-means++ inertia vs. number of clusters – LDA . . . . . . . . . . . . . . . . . 71 6.22 LDA cluster distribution for 3 types of malicious accounts . . . . . . . . . . . . 71 6.23 Heatmap with average topic attention – cluster 39 . . . . . . . . . . . . . . . . . 73 6.24 Heatmap with average topic attention – cluster 35 . . . . . . . . . . . . . . . . . 73 xiv LIST OF FIGURES vs. number of clusters – mentions . . . . . . . . . . . . . . . 84 xi 6.29 Embeddings cluster distribution for 3 types of malicious accounts . . . . . . . . 84 D.1 October 19 th th th List of Tables

LIST OF TABLES xiii

List of Listings Simplified example of the JSON configuration file . . . . . . . . . . . . . . . . . 37 2

Example tweet by @DavidMagarreiro . . . . . . . . . . . . . . . . . . . . . . . 56 3

Example of a Twitter API user object . . . . . . . . . . . . . . . . . . . . . . . . 105 xviii

LIST OF LISTINGS xv Glossary

Docker is a platform as a service (PaaS) product that uses OS-level virtualization to deliver software in packages called containers 30, 32, 42, 43

Docker Compose is a tool for defining and running multi-container Docker applications 32, 43

Gensim is an open-source library for unsupervised topic modeling and natural language processing, using modern statistical machine learning 60, 75

GitHub Pages is a static web hosting service offered by GitHub since 2008 to GitHub users 33 hydrate

In the context of APIs, hydrating means getting the complete object from an incomplete version of that object by means of an API call 39

Jupyter is an open-source project providing interactive computing notebooks and other services xiii, 29, 35, 38

MongoDB is a general purpose, NoSQL document-based, distributed database built for modern application developers and for the cloud era 30, 32, 34, 40, 41, 45

Neo4J is a native graph database, built from the ground up to leverage not only data but also data relationships 30

Pushbullet is a service that provides, among other services, a push notification API, along with web and mobile clients to receive them 34, 38

Sklearn ( a.k.a. Scikit-learn) is a free software machine learning library for the Python programming language 60, 61

State of Emergency is a government self-declared situation that empowers the government to implement policies that it would otherwise not be able to instate 48, 49, 53

TensorFlow is an end-to-end open-source platform for Machine Learning 82

Twittersphere is the collective group of Twitter users and their interactions 2, 10, 27, 31, 40, 75, 86, 87, 90 –

93 xx

Glossary

Abbreviations

API

Application Programming Interface 1, 3, 9, 10, 14, 18, 27 –

30, 33, 34, 38 –

42, 45

CBOW

Continuous Bag of Words 76

CPU

Central Processing Unit 42

DOI

Digital Object Identifier 16

GAN

Generative Adversarial Network 25

I.P.

Informal Publications 16 IP Internet Protocol 33

JSON

JavaScript Object Notation 29, 36, 38

LDA

Latent Dirichlet Allocation 20, 25, 58 –

60, 62, 66, 74, 89, 90, 92 ML Machine Learning 19 –

21, 25, 59, 89

OSN

Online Social Network i, xiii, xv, 1, 2, 5 –

8, 10 –

12, 14, 22, 24, 54, 89, 95

RAM

Random Access Memory 42

REST

Representational State Transfer 9, 28

SCS

Social Cyber Security 2, 11

SLR

Systematic Literature Review 3, 15, 21, 25, 89, 91 SN Social Network 2, 5, 6, 10, 89, 95

SNA

Social Network Analysis 21

SRQ

Survey Research Question 15, 16, 18

SSD

Solid State Drive 42

SSH

Secure Shell 42

SVM

Support Vector Machine 20 TJ Trade Journals 16 UI User Interface xiii, 9, 31 – UMAP

Uniform Manifold Approximation and Projection 82

UML

Unified Modeling Language 9

URL

Uniform Resource Locator 54, 55, 60

Chapter 1

Introduction “I’ve never been a good estimator of how long things are going to take.”

Donald Knuth

Twitter is one of the most popular Online Social Networks (OSNs) in existence, having more than

149 million daily active users worldwide, as of Q3’ 2019 . Due to its widespread adoption and the social backgrounds it encompasses, it has also become a prime platform for political discussion. Simultaneously, it exists alongside a fully-featured Application Programming Interface (API) that can be used, among other things, for automating account interactions with the OSN. This API allows for a multitude of useful real-time tasks, such as public risk monitoring [42], advertisement [31], customer service [40], public opinion tracking [68], and stock market prediction [68], to name a few. Twitter and its massive adoption can be traced back to several significant moments in recent history. First, we can trace it back to the creation of the internet by Tim Berners-Lee in 1989, which was later reported in his infamous 1992 paper [9]. By 1997, the first version of OSNs appeared [47] 1 https:/twitter.com https://investor.twitterinc.com https://developer.twitter.com/en/docs Introduction – as first attempts at using the power of this connected new world. SixDegrees and AsianAve are two primordial attempts at that. Since then, many new Social Networks (SNs) have surfaced, each with its specific rules and environment, their adoption rates growing with the democratization of the personal computer. More so, in the past decades, due to the extremely high penetration smartphones have as a social commodity . The combination of this ease of access and connectivity led to a level of adoption and use of OSNs that made them a constant presence in many people’s lives. However, Twitter and other OSNs’ benefits come with risks. One such risk is that the automation power it provides can be used with ill intent. Increasing the relevance of topics [49], influencing opinions [6], or even suppress the rise of insurgent movements [67] are only a few examples. On top of this, nothing prevents users from creating fake accounts, regardless of having good or bad intentions. Twitter is, therefore, an undeniably complex digital ecosystem that stands vulnerable to both small scale independent political influence as well as to orchestrated attempts at wide-scale political manipulation – some go as far as coining these phenomena as Social Cyber Security (SCS) [10] [28]. With all the above considerations in mind, the present work focuses on shining a new light on the process of detecting, analyzing, and measuring the impact of malicious political interference on Twitter. First, there are still a lot of unsolved issues in current research, as we will see in the following chapters. Second, societies all over the world should have tools that allow them to better regulate and monitor political events so that ill-intended actors will not be able to keep on taking advantage of the vulnerabilities in OSNs. Therefore, there is a clear need to evolve the research lines of online opinion manipulation further. This need should be mainly focused on practical application. Due to this, aiming to build a context-free strategy to tackle this problem is detrimental. Most approaches observed nowadays are both limited to context and focus on a granularity level that reinforces their limitations, as will be further explained in Section 3.3 (p. 18).

This work will focus on understanding what is the current stance in regards to detecting political manipulation campaigns on Twitter, identifying limitations in the state-of-the-art approaches, pursuing http://sixdegrees.com/ https://en.wikipedia.org/wiki/AsianAve https://en.wikipedia.org/wiki/List_of_countries_by_smartphone_penetration a line of research that is oriented towards overcoming these limitations, and applying it to the Portuguese Twittersphere. This work aims at testing the following hypothesis:

Do different high-level approaches to Twitter data analysis, when applied to distinct types of malicious activity, a) lead to meaningful results and b) agree among themselves?

In this case, high-level will refer to approaches that avoid the account-level analysis, like building classifiers based on the features of one account, and instead focus on looking at substantially larger amounts of data pertaining to a larger set of accounts. In this sense, we look at data with an intent of uncovering patterns that are related to the behavior of the overall pool of accounts analyzed, or specific to custom account types within that pool. Although the full explanation for choosing the hypothesis is a consequence of the work presented in Chapter 3 (p. 15) and Chapter 4 (p. 23), it is advanced in this section as a means for the reader to have it in mind throughout the remaining parts of this work. For more details on the hypothesis and its validation, please refer to Section 4.3.1 (p. 25).

This document is divided into six other chapters. Chapter 2 (p. 5) will provide sufficient knowledge of how Twitter works and what some of the fundamental definitions used throughout this thesis. Chapter 3 (p. 15) contains a protocol description followed by the actual execution of a Systematic Literature Review (SLR) oriented towards mapping current research efforts in the fight against social manipulation for political purposes on Twitter. This chapter also works as a catalyst for understanding the limitations of current research. Chapter 4 (p. 23) builds upon this understanding and provides a collection of current approaches that are detrimental for solving the limitations identified in Chapter 3 (p. 15). It also includes a more detailed definition of the problem and the hypothesis, and how it will be validated. Chapter 5 (p. 27) describes a framework for massive structured data collection using the Twitter API. This framework is then used to generate the data analyzed in Chapter 6 (p. 47). Finally, Chapter 7 (p. 89) focuses on summarizing the rest of the work and providing a synthesis of the main ideas presented and conclusions drawn, bringing a full-circle view on how we can look and combine high-level approaches as a useful tool for political manipulation detection. 4

Introduction

Chapter 2

Background “A reliable way to make people believe in falsehoods is frequent repetition, because familiarity is not easily distinguished from truth. Authoritarian institutions and marketers have always known this fact.”

Daniel Kahneman, in Thinking, Fast and Slow

This chapter describes the main concepts required to understand the rest of the work by having a common ground of terminology. It first goes through a more in-depth description of Online Social

Networks (OSNs) and Twitter’s inner workings. It then delves into what strategies and goals are typically associated with OSNs manipulation, and what main categories exist to describe it.

A Social Network (SN) is a collection of people connected by their social relationships and interactions – these have been present ever since humankind’s earliest times. With the advent of the World Wide Web [9], however, the underlying idea to SNs soon started being mapped to the online world.

OSNs are defined as “online communities among people with common interests, activities, backgrounds, or friendships” [ and others [31]. Since then, the 5 http://sixdegrees.com/ Background democratization of technology potentiated by the appearance of smart-devices like smartphones led to mass adoption of these virtual environments. OSNs like Facebook , Twitter, Reddit , Instagram , LinkedIn and others have since came up. Figure 2.1 was adapted from [31] and shows a number of OSNs created between 1997 and 2001. These platforms give users a myriad of possibilities like establishing social presence [13], managing contacts [31] or engaging with news [40]. Their continuing success is undeniably linked with their addiction-oriented design [21 ] and it is not expected that they should lose their users’ attention, time and investment any time soon [21]. Figure 2.1: OSNs creation timeline OSNs give people an online persona and allow for interactions with content or people, typically through shared interests. Because of this, they can be represented as graphs. The exact representation varies between different SNs. In Twitter, users can follow other users, unidirectionally. Whereas Facebook friendships have to be bidirectional so that an actual link exists between accounts. Although this is the most typical way of representing an OSN – with users being nodes and their direct connections being edges – it is also possible to build other types of graphs. One example is a graph where Twitter hashtags are the nodes, and an edge exists between two hashtags if both co-occur in a tweet, this is illustrated in Figure 2.2 (p. 7) taken from [62]. http://instagram.com/ Background

Table 2.1: Overview of main OSNs by size and main characteristics LinkedIn Facebook Reddit Twitter Instagram Creation year 2002 2004 2005 2006 2010 Montlhy active users 260M 2500M 430M 330M 500M Purpose S,L C,S,SO S,SO,D S,SO,D C,S,SO Relationship profile B B,S S B,S S Indentity type both both individual both both User connections U U/D D D D

Purpose labels : S=sharing, L=linking, C=Communication, SO=Socializing, D=Discussion

Relationship labels : B=Business, S=Social

User Connection labels : U=undirected, D=Directed

Twitter will be the OSN studied in this work. As such, more attention is dedicated to understanding its inner workings, how its data can be accessed and how behavior can be automated on it.

Twitter is a microblogging service [40] released on July 15 th , 2006. Its initial format allowed users to create an account and post 140 character-long messages [40]. These messages are called tweets . More recently, in 2017, this limit changed. Nowadays, tweets can have up to 280 characters. Officially, this is meant to allow people to express themselves more easily . Besides writing tweets – tweeting – users can also follow other accounts, retweet , reply to tweets, like tweets, use mentions and hashtags , and search. As users start following other accounts, their timeline represents what those accounts do on the OSN. A retweet is a way of replicating what someone else has tweeted on their timeline. If the retweet includes a comment it is called a quote tweet or retweet with comment . Those who follow a user can see these retweets in their timelines. A Retweet is conceptually a tweet that links back to another tweet. Replies are similar, but their goal is to engage in discussion in the context of a tweet, leading to a discussion thread for a given tweet. Likes represent support for a tweet and will also appear on the user’s timeline. Every tweet text can include mentions and hashtags. Mentions are of the form @twitter and will reference an existing account from inside the tweet. Similarly, hashtags appear as – Twitter uses them to calculate trending topics and also to help people search for relevant content [40]. For a more detailed explanation of Twitter https://blog.twitter.com/en_us/topics/product/2017/tweetingmadeeasier.html Background real-world usage, please refer to The Twitter Book [53]. Figure 2.4 (p. 9) shows the relationships among the different entities and

Much as a consequence of the automation freedom the Twitter API provides, wicked applications quickly rose in the Twittersphere. Although this automation is under strict regulation , the problem remains. This section contains more information on the types of non-allowed behavior that is typically present on Twitter, what concepts surround it, along with useful nomenclature. Shannon’s Information Theory [

59] represents the earliest significant attempt at a mathematical definition of information. Despite its relevance in terms of data communication, this definition is too strict. In the context of OSNs, information needs to be interpreted in its well-known variant, or as described in [64 ], a “communicative construct wh ich is produced in a social context”. Indeed, in terms of SNs, it is hard to escape the concepts of information, misinformation, and disinformation. In [36], misinformation is defined as a type of information since it has no inherent constraint on its veracity. Misinformation can be defined as simply as incomplete information [43] or as thoroughly as inaccurate, uncertain, vague, or ambiguous [36] [37]. When misinformation is produced with the intent of spreading a falsehood, it is treated as disinformation. In [22], it is argued that disinformation is not a subset of misinformation since a piece of information that is true but is still misleading is indeed disinformation and might not qualify as misinformation – disinformation needs only be misleading, according to those authors. In this work, however, we separate the concept of disinformation and misinformation by its deceptive intent or lack thereof, respectively. Table 2.2, adapted from [36], contains a comparison of these three concepts. The label Y/N should be interpreted as context and time-dependent. Table 2.2: Comparing Information, Misinformation and Disinformation [36] Y=Yes, N=No, Y/N=Yes and No, depending on context and time https://help.twitter.com/en/rules-and-policies/twitter-automation True

Informative 2

Background

Nowadays, the concept of misinformation and disinformation is closely related to that of fake news . However, a clarification of what is understood by “news” is in order.

For clarity reasons, we shall adopt the view of news as a byproduct of journalism [63] that is, in its common format, an account of recent, exciting, and meaningful events [38]. Consequentially, fake news is the intersection between news and either misinformation or disinformation. In a recent survey [66] of fake news typology, the authors identified the following types of fake news: news satire, news parody, fabrication, manipulation, advertising, and propaganda. Additionally, more pragmatic definitions focus on verifiably false [2] viral posts made on social media that imitate legitimate news reports [63].

A clarification is required given the lack of agreement in the literature as to what is a troll , what is a bot , and what lies between them. Some authors define trolls as human-managed accounts that have a negative online behavior [30] whose goal is to attract as many responses as possible [20]. Simultaneously, some authors define bots as automated agents with similar goals [26]. Others refer to the term social bots [67] as an indistinct version of either trolls or bots. Furthermore, some of these accounts that post malicious content do so in a hybrid manner by combining automated and possibly centralized command with human-produced content and insights [10] [39]. These techniques aim at staying under the radar since they dilute the most apparent bot activity giveaways. Since this nomenclature is not universal, we feel we need to clarify how we use it. In this paper, we use the term bot to refer to an automated account; troll to refer to potentially manually managed accounts, even though in practice it is possible to be fooled by automated accounts that resemble human behavior, so this is the one term that can fluctuate in the automation spectrum; malicious account as referring to either trolls or bots; automated account, automated agent, social bot, to refer to bots. So, if we observe a high level of automation, we consider that account to be a bot, otherwise we will use the term troll. Additionally, not all bots are malicious, but all trolls are malicious. With this compromise, we can focus on the effects of malicious behavior rather than on specifying what originated it. Besides, it also lifts the dangerous assumption [28] that these accounts are not managed by a human today and a script tomorrow.

Trolling has accurately been described as a Social Cyber Security (SCS) problem [10]. Following that perspective, the authors in [27] have compared trolling with hacking. There is a target – OSNs. There are vulnerabilities – the operating principles of OSNs. There are exploits – that take advantage of these vulnerabilities. Simultaneously, there is also a distinction between black hat, gray hat, and white hat trolls – as is the case with computer hackers [27]. Black hat trolls are driven by personal gain and can elude ethical issues. Grey hat trolls have the opposite behavior, typically to push altruistic or anti black hat troll agendas. White hat focus on identifying both previous types and report them to appropriate entities so that they can be removed. In this work, our primary focus is that of identifying black hat (malicious) trolls. However, gray hat trolling activity may have comparable properties and can thus also be labeled as trolling by detection techniques. Another convention defined for this work is that of using the terms manipulation campaign, opinion manipulation campaign, manipulation efforts, malicious campaign, bot campaign, and social bot campaign interchangeably. Although it might not be fully coherent with the black/gray hat distinction in the previous paragraph, it will simplify common terminology without impacting the experimental work. In addition to the previous section, but with less focus on defining recurrent terms and more on providing an overview of the trolling phenomenon, we present Table 2.3 (p. 13) taken from [27]. This table comprehensively consolidates the multitude of techniques and approaches that trolls can use and is a good reference point to take into account when classifying opinion manipulation campaigns in OSNs. 4 Background

13 Table 2.3: List of Trolling Techniques taken from [27]

Technique Name

Technique Description

Source Hacking, “Journobaiting”

Planting false claims or posing as sources to dupe mainstream media, especially in the wake of a crisis event.

Keyword Squatting

Associating a rarely used keyword or search term, especially one that becomes suddenly popular in the wake of a crisis event, with disinformation or propaganda.

Denial of Service

Overwhelming a public space with information or data designed to promote exhaustion and disaffection, or generally frustrate sensemaking efforts.

Sockpuppetry

The creation and management of social media accounts designed to appear as authentic individuals , typically using “data craft” [

Persona Management,

Botherding

The co-ordination of multiple sockpuppet accounts or their algorithmic automation as a botnet

Ironic Hedging / Bait and

Switch

Using hateful or extreme language and imagery in a way that creates plausible deniability about intent and empowers messagers and some interpreters to downplay sincerity and seriousness.

Political Jujitsu

Soliciting attack from an opponent to elicit sympathy from political allies, ground victimization narratives, facilitate recruitment, and justify counterattack.

Controlling the Opposition

Using sockpuppet accounts to pose as representatives of an oppositional group

Astroturfing

Using sockpuppet accounts to create the illusion of a bottom-up social movement or social outcry.

Wedge Driving

Inserting narratives designed to create divisive infighting among social groups. Often part of an overarching “divide and conquer” strategy.

Memetic Trojan Horses

The popularization of seemingly banal content that opens the Overton Window [8] by prompting commentary from mainstream journalists

Deep Fakes

Altering photographs and videos to change the original message, in a way that is difficult to detect.

Concern Trolling

Disingenuously expressing concern about an issue in order to derail discussion and damper consensus. Posing as a concerned ally or objective third party in order to make criticisms more palatable.

Brigading / Dogpiling

Bombarding a targeted individual or space with ad hominem attacks from multiple accounts

Conspiracy Seeding

Spreading “breadcrumbs” on social media and anonymous forums to nudge participants towards conspiracist thinking

Algorithmic Gaming

Exploiting the functioning of an algorithm or related databases to elicit a result not intended by its designers.

In this chapter, we have established common ground in terms of background knowledge required to understand the following chapters better. This knowledge includes an understanding of what OSNs are, what are the main OSNs in existence, how Twitter and its API work, known trolling practices and relevant trolling nomenclature. 6

Background

Chapter 3

Systematic Literature Review (SLR) “Sometimes you will hear leaders say, ’I’m the only person who can hold this nation together.’ If that’s true then that leader has truly failed to build their nation.”

Tim Marshall in Prisioners of Geography

The present chapter describes the protocol and results of a Systematic Literature Review (SLR) conducted with the intent of gaining in-depth knowledge of the types of approaches used nowadays to detect malicious content and its spread within Twitter.

This section contains a formal definition of the process used during the SLR. It intends to ensure both the soundness of the procedure as well as its replicability. SpLuRT , a TypeScript command-line tool, was chosen to organize the search and filtering stages. First, we raised three Survey Research Questions (SRQs). Secondly, we constructed a search query aimed at finding academic papers that can answer these questions along with inclusion and exclusion criteria. Then, on November 23 rd , 2019, we performed the search on the Scopus and DBLP databases, resulting in a total of 2,787 papers. This initial set was filtered as described in Figure 3.2 (p. 17), and the final paper count obtained was 26. Each of these papers was analyzed https://github.com/arestivo/splurt https://dblp.org/

15 8

Systematic Literature Review (SLR) and summarized, although we only include aggregated considerations of their content, namely in Section 3.3 (p. 18). Finally, we propose answers to the SRQs and included detrimental takeaways that will guide the rest of the work in this thesis.

In the context of uncovering, classifying, and measuring the impact of malicious political disinformation and misinformation efforts on Twitter:

SRQ1

What data types and metrics are currently extracted?

SRQ2

What techniques are being used to process the different data types?

SRQ3

To what end are the analyses conducted?

The search was guided by the following exclusion and inclusion criteria.

Exclusion Criteria

Exclude papers that are: • not written in English; • missing Digital Object Identifier (DOI); • marked as Informal Publications (I.P.);• marked as Trade Journals (TJ); • surveys. Inclusion criteria

Include papers that: • are at least as recent as 2016; • have a title related to the SRQs; • have an abstract related to the SRQs;• are cited according to the following rules: ◦ ≥

10 if from 2016; ◦ ≥ ◦ ≥

3 if from 2018; ◦ ≥ Below is the search query constructed to look for papers that helped answer the SRQs: twitter AND (politics OR political OR election) AND (bot OR troll OR ֒ → agents OR actors OR ``fake news'' OR misinformation OR ֒ → disinformation OR ``information operation'') Figure 3.1 presents a diagram of the filtering steps and includes information on how many papers were excluded in each step. Figure 3.1: Diagram of the exclusion and inclusion criteria application for the SLR In total, 26 papers were identified for review. The next section contains an aggregated analysis from all of them.

This survey constituted an enlightening research effort. It served as a structured way of answering the SRQs, to identify trends, new approaches, and systematic flaws in the research. This discussion focuses on clarifying each of the above points. < Systematic Literature Review (SLR)

Although the surveyed papers share some common nomenclature in terms of the types of features used, there is no one major classification system. As such, we are proposing a standard set of feature types that cover the main sources of features. It should be noted that a large subset of features can be calculated and engineered, and thus we focus on the goals of the features used, rather than on their original nature. The proposed feature types are:

Account

Includes features that can be read directly from the account metadata returned by the Application Programming Interface (API) like creation date, username, description

Content

Includes features derived from tweet content, like

Network

Includes features that describe accounts in terms of presence in the network like

Activity

Includes behavior metrics like frequency of posts, active hours, active days of the week, and

Others

This class serves to include custom metrics like political orientation labels, or labels taken from other tools. Table 3.1 (p. 19) contains an aggregated view of the feature classes used by each paper that handles these features, regardless of the goal.

Table 3.1: Comparison of feature types used per paper (excluding 3 papers with no feature use reported) 1 Although it makes little statistical sense to assume this sample is representative of the whole set of papers, we can report a few curious observations. First, even though account data is straightforward to obtain and use, it is not as used as content or network. Second, only four papers try to use all of the data types (excluding others); this might be a good indicator that the literature does not focus on using extensive amounts and sources of data.

We have identified several groups of techniques. These groups are agnostic to the goals of the studies. For instance, in [50], the authors used Decision Trees (a technique for Machine Learning (ML) classification or regression tasks) as a stepping stone to measure feature relevance. One of these groups – specialized tools – is dedicated to external tools and not techniques. • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • • Systematic Literature Review (SLR)

The identified groups are ML classifiers, data representation, community detection, sampling techniques, and specialized tools. Below is a list of each technique per group followed by the papers in the study that use them. This list excludes libraries and frameworks that are not relevant to the context of the study. Each group of techniques is sorted by decreasing number of papers that use them.

Machine Learning Classifiers • (7) Neural Networks [58] [39] [14] [34] [18] [17] [23] • (4) Random Forest [24] [10] [4] [17] • (4) Support Vector Machine (SVM) [14] [34] [4] [18] • (3) AdaBoost [46] [39] [4] • (3) Decision Trees [4] [18] [50] • (3) Naive Bayes [24] [4] [18] • (2) Logistic Regression [39] [66] • (2) Long short-term memory Neural Networks (LSTMs) [39] [54] • (2) Convolutional Neural Networks [14] [54] • (1) K-Nearest Neighbors [4] • (1) SGD classifier [39] • (1) Sequential Minimal Optimization [24] • (1) Logit Boost [17] • (1) Gradient Boosting [34] Data Representation • (5) Word2Vec/Dov2Vec [14] [25] [54] [66] [69] • (4) Latent Dirichlet Allocation (LDA) [46] [4] [23] [69] • (3) TF-IDF [14] [54] [66] • (2) GloVe [39] [66] • (1) BERT [14] • (1) Relief Algorithm [18] Community Detection • (4) Louvain clustering [65] [61] [25] [23] • (2) Label Propagation (LPA) [58] [25] • (1) Link Communities [25] • (1) Oslom [25] Sampling Techniques • (3) SMOTE [39] [34] [17] • (1) Edited Nearest Neighbours (ENN) [39] • (1) TOMEK [39] Specialized Tools • (4) DeBot [26] [57] [50] [54] • (3) Botometer/Botornot [7] [28] [50] • (2) BotWalk [26] [50] • (1) Sentimetrix [58] • (1) Sentistregnth [4] • (1) ORA Network Tool [65] All in all, we have the ML techniques group as being more prevalent than the others. This over-representation is explained by the fact that the majority of papers filtered focus on creating bot detection systems. It is also interesting to see that embedding-based techniques like Word2Vec, GloVe, and BERT are standard for semantic representation of text, but not for other types of representations. We have found that the main research lines are: performing data-oriented ad-hoc analyses for a given context, developing bot (or similar) classifiers. Others focus on using Social Network Analysis (SNA)-inspired approaches (like community detection), or on creating consensual taxonomies for future research.

The most valuable takeaway from this study is identifying a systematic fallacy in the literature. Most approaches either lack the eyesight to acknowledge the well-defined scope of their data and models, or lack the will to test it. The fallacy is (wrongly) assuming that bot behavior is not evolving and adaptable to detection systems when designing the very systems that aim to thwart bots. Some authors report dataset-specific conclusions like “bots generally retweet more often” [26 ] or “social spambots and fake followers are more inclined to post messages with a positive sentimental fingerprint” [

4] as eternal postulates. These are taken as absolute truths when they are perfectly susceptible to adaptive strategies taken by bot creators. Some authors recognize this limitation [69] [10]. In [28 ], the authors start by asking “whether the detection of single bot accounts is a necessary condition for identifying malicious, strategic attacks on public opinion, or if it is more productive to concentrate on detecting strategies?” and then follow through with suggesting a shift in the current approaches to bot detection. It is precisely this conclusion that we take from this SLR process. 4

Systematic Literature Review (SLR)

Furthermore, we believe that high-level approaches are detrimental for developing robust approaches to detect political interference on Twitter. The most promising ones focus on capturing the evolution of the Online Social Network (OSN) rather than on pinpointing automated accounts, these include the high-level ad-hoc analyses but also the community detection efforts such as the ones in [25] [23]. As such, the remaining of this work, will focus on testing different high-level approaches that focus on leveraging data features that are not susceptible to adaptive strategies, or at least at using the ones that seem less so.

Chapter 4

Problem Statement “Power is okay, and stupidity is usually harmless. Power and stupidity together are dangerous.”

Patrick Rothfuss in The Name of the Wind

Now that we have a good understanding of the context, the main approaches found in the literature, and their limitations, we can focus on defining the problem we will tackle and how to do so. Firstly, it is good to restate that approaches which leverage bot detection fall short of scalability and are highly susceptible to adaptive strategies. Secondly, we conclude that the problem of opinion manipulation on Twitter should be tackled from a high-level perspective, by taking a bird’s eye view of the system – leading us to Assumption 1: Assumption 1.

High-level approaches (to uncover political manipulation on Twitter) are less prone to irrelevant patterns and conclusions than fine-grain ones.

Thirdly, we have seen that many insights on how to detect bots, taken from previous studies, are quite clearly only relevant for a small amount of time. Such is due to the level of effortlessness that exists for bot creators to overcome them. Examples include the inconsistency of daily schedule activity with that of humans, lack of personal data, or even ones that require some more commitment like account age. Taking this into account, we establish Assumption 2: 23

Problem Statement

Assumption 2.

The best way to design scalable and durable methods (of manipulation detection on Twitter) is to use data features that are harder for malicious orchestrators to manipulate.

6 The insights from Chapter 3 (p. 15) lead to the conclusion that two things are hard to forge and manipulate. The first one is the content of what is said – if the goal is to change someone’s mind on a topic that topic needs to be covered. The second one is influence, presence or interaction – if the goal is to reach many users, or a particular group of users, it becomes unavoidable to establish a strong presence and reach in the network. Therefore we shall focus on using data that contains latent information on these two inescapable facets of manipulation in Online Social Networks (OSNs). All in all, the problem we investigate is assumed to require an approach that is both high-level and as hard to manipulate as possible. With these two pillars, we now have a properly defined foundation for understanding the origin of the hypothesis stated in Section 1.5 (p. 3): Do different high-level approaches to Twitter data analysis, when applied to distinct types of malicious activity, a) lead to meaningful results and b) agree among themselves?

To answer it, we will conduct three different exploration efforts: one content-oriented ( cf.

Section 6.3.1, p. 59), one metadata-oriented ( cf.

Section 6.3.2, p. 74), and one structure-oriented ( cf.

Section 6.3.3, p. 80). These are our high-level approaches that focus on using hard to manipulate data, with metadata representing the weakest of the three in terms of manipulability. Although we have already described the meaning of high-level in this context ( cf.

Section 1.5, p. 3), we have not introduced the meaning of distinct types of malicious activity. These distinct types correspond to groups of accounts that are considered malicious for distinct reasons. Section 6.2 (p. 52) contains a detailed description of each of these and how they are obtained.

Before we can answer our hypothesis, however, we need to understand each high-level approach individually. This is where we introduce the need for a preliminary study. This study consists in understanding what type of information we can extract from each approach, and how meaningful it is, essentially answering the first point of our hypothesis. Only then will we be able to ascertain the second point – the existence of agreement between the results of each approach. To achieve this goal, we will delve into understanding the information gathered, for each of the three approaches, in their respective sections; and then briefly recall this in their individual summary, in: Section 6.3.1 (p. 59), Section 6.3.2 (p. 74), and Section 6.3.3 (p. 80) corresponding to content-oriented, metadata-oriented, and structure-oriented, respectively. At this point, it becomes clear that the hypothesis stems naturally from the analyzed literature. It requires existing techniques like semantic content representation, embeddings, clustering, and other forms of representation. These techniques can easily be combined and tweaked to the specific context at hand. The existence of previously mentioned similar works and also parallel research efforts further support that this is neither a lost cause nor a finished one. Thus we argue that the presented hypothesis is plausible . Although no single approach found in the literature attempts to validate our hypothesis, many recent works touch subtasks that we shall have to go through in order to validate it. In terms of uncovering topics from textual content, one of the most standard approaches is Latent Dirichlet Allocation (LDA) [11] – a statistical model that was significantly used ( cf. Section 3.3.2, p. 20) and even referred to in the Systematic Literature Review (SLR) process. More recent works have been using word embeddings to uncover topics in streaming data [16] and even as a stepping stone for topic inference and correlation [35]. The authors of [55] propose an algorithm that is particularly relevant for the present case, namely due to its ability to combine content and structure, allowing us to measure the aforementioned duality of content and network influence. [23] shows another attempt at combining content and network information, especially interesting as they also use LDA for topic detection, but introduce a temporal notion into their model. Other approaches focus on performing community detection on custom-built graphs that already embed content and topology [61] [25]; this allows for more traditional algorithms to be used. Other alternatives include deep-learning-inspired Generative Adversarial Networks (GANs) [52] that can generate overlapping communities where each vertex has a strength that describes its level of membership towards a given community. In terms of evaluation, it is also relevant to consider the work of [25] that proposes a new metric to evaluate the Internal Semantic Similarity (ISS) of communities.

Along our work, we will complement each of our approaches’ choice with references to other related work that, for being related only to particular tasks and decisions, is found to be more beneficial when located at the moment it becomes relevant.

In order to validate the proposed hypothesis, we will use validation at intermediary steps in the process of developing Machine Learning (ML) models. We will gather Twitter data and enrich it with labeled examples obtained from different sources, in order to ensure our results are not susceptible to biased data. Most of our exploratory work will focus on comparing different highlevel strategies, and their results will reflect their potential viability in validating our hypothesis. In this sense, it is hard to establish one single metric that will reveal whether our exploration processes

Problem Statement works in different contexts. We find it relevant, however, to establish them nonetheless. Especially in the hopes that future work in our research line will continue revealing the feasibility 8 of high-level approaches that focus on content and interaction. One final aspect of our validation process is that we are collaborating with a peer working on a parallel problem – creating a troll classifier for the Portuguese context – and our results will be compared to theirs in an attempt to mitigate the difficulty in non-abstract validation methods. Even with respect to other works that focus on validating bot detection models, it remains hard to validate them, knowing it is commonly accepted they are validated against incomplete example sets. In fact, it is known [29] [46] that the accounts that Twitter suspends and blocks are found with a conservative mindset – their goal is to have high precision and not high recall. Overall, we will compare the results of our independent exploration efforts to ascertain if their results make sense when interpreted together. We will also rely on multiple sources of labeled data – one of which a peer working in a similar issue – to mitigate dangers of biased data. Finally, we will perform most of our data collection and then data exploration with a focus on ensuring that both data quality and the results reached are statistically significant and free from typical mistakes and biases found in similar works. The main goal of this work is to push the current state-of-the-art of detecting malicious political content on Twitter further, ideally by producing a visual tool – or a first draft at one – that can put the developed solution in the hands of the public, the media, and Twitter itself. In order to do so, we shall experiment with state-of-the-art tools for each of the sub-tasks in our work: data collection, data representation, temporal representation of data, topic detection, community detection, and visualization of the evolution of these topic-aware communities over time. This chapter has drawn a line on where state-of-the-art research is, relating to political opinion manipulation on Twitter, its issues, and potential that more theoretical lines of research contain. We have stated three assumptions that will support the investigation of the hypothesis. We have argued that this is a plausible hypothesis and how we shall validate it, even with the known constraints.

Chapter 5

Twitter Watch “Humans were always far better at inventing tools than using them wisely.” Yuval Noah Harari in

21 Lessons for the 21st Century

The first requirement for analyzing Twitter data is, well, having data. Although it is not uncommon for researchers to reuse already existing datasets for the validation of new approaches [14] [23], this option was not viable to us, as the most recent dataset focusing on the Portuguese Twitter context that we could find was from 2016. At the time of starting this work, we wanted to focus on the 2019 Portuguese legislative elections, occurring on October the 6 th , 2019. Since no data was available for immediate use, we had to collect it. This section reveals how we achieved this goal and also how and why we developed a new Twitter data collection framework – Twitter Watch.

As specified in Section 2.2.2 (p. 9), Twitter has an Application Programming Interface (API) that can be used for a multitude of purposes, we will focus solely on data collection for research purposes. Since our goal is to focus on a specific context – the Portuguese Twittersphere – and on a particular political event – the 2019 legislative elections – we needed to find a tool capable of extracting a relevant dataset to conduct our study. We did not find such a tool. Indeed, we listed the 27 Twitter Watch requirements that such a tool should fulfill in order to generate a relevant dataset of both users and tweets: Data Collection Requirements 5.1: Initial requirements 1.

Capture a specified period of time; 2.

Start from a list of relevant Twitter accounts and dynamically find new potentially relevant ones; 3.

Restrict the collected data to the Portuguese content as much as possible; 4.

Detect suspension of accounts as they occur; 5.

Properly explore the API limits, as that is a potential bottleneck; 6.

Save the data in a way that facilitates the subsequent analysis; 0 7.

Ensure a coherent structure and avoid redundancy; 8. (Optionally) Allow regular backup of dataset snapshots; 9. (Optionally) Notify failures in real-time; 10. (Optionally) Ensure a logging mechanism to follow both progress and failures during the collection process; 11. (Optionally) Provide a visual interface that facilitates monitoring the collection process; 12. (Optionally) Be adaptable to different collection goals through an easy configuration; 13. (Optionally) Allow flexibility in how the data collection evolves. With the above requirements in mind, we looked for tools that would either satisfy them or be flexible enough to accommodate them through easy changes. On one side, we have commercial tools like Hootsuite , Sysomos , or Brandwatch that are both commercial and abstract the access to the data, but focus on using the search endpoints by looking at hashtags or search terms, as explained in Section 2.2.2 (p. 9) this means only a seven-day window of past data is available. Although this work had some initial efforts of data collection surrounding the election period ( cf. Section 5.2, p. 29), the usage of such approaches focusing on endpoints and a very narrow window for that collection discouraged the use of both that endpoint as the sole source of data and of tools that relied heavily on it. These observations mean these tools fail many of the mandatory requirements, the most limiting being 1, 4, 6. On the other side, we have open-source tools like Socialbus , TCAT , sfm-twitter-harvester , or Twitter-ratings that are more oriented towards research data collection. These tools, however, are limited. Socialbus and TCAT are configured to filter tweets by users or by topics/keywords, but these need to be specified beforehand, and any dynamic evolution is relying on restarting the system, therefore not meeting requirement 2. Sfm-twitter-harvester can be interacted with as either a Representational State Transfer (REST) or streaming API, this means it is more flexible but lacks the persistence desired when building a dataset, it can actually be seen as an abstraction layer above https://hootsuite.com/ https://sysomos.com/ https://github.com/LIAAD/socialbus https://github.com/digitalmethodsinitiative/dmi-tcat https://github.com/gwu-libraries/sfm-twitter-harvester https://github.com/sidooms/Twitter-ratings the API, and fails to meet requirements like 2, 6. These tools are still found to be too generic and don’t add a lot of value for the current use case when compared to the API wrappers available like Tweepy and python-twitter . This initial desire to find a suitable tool was unmet, and the gap remained open. Before actually implementing a solution that would close it by fulfilling the above requirements, we hit some metaphorical walls that are described in the next section as a reference point for anyone interested in achieving a similar effort. Initial approaches were ad-hoc and, unsurprisingly, faulty. Even before starting to collect data, we focused on understanding Twitter API, its different response formats and objects , endpoints , rate limits and response codes . In the end, we also developed a simple open-source scraper that generates two JavaScript Object Notation (JSON) files containing error and response codes that can be used to better understand the interactions with the API. Then, having chosen python-twitter as the API wrapper to use for the collection process, we identified all the accounts from the Portuguese political context that fell into one of the following categories: • Political party account; • President of a political party. The final number of accounts found was 21, and this process was conducted in October 2019. Starting from this seed of accounts we designed a single page Jupyter notebook that underwent the following phases of: 1. (A) Get updated profile information on the seed accounts; 2. (B) Get all the followers of (A); 3. (C) Get all the followees of (A); https://github.com/tweepy/tweepy https://github.com/bear/python-twitter https://developer.twitter.com/en/docs/tweets/data-dictionary/overview/ intro-to-tweet-json https://developer.twitter.com/en/docs/tweets/search/api-reference https://developer.twitter.com/en/docs/basics/rate-limits https://developer.twitter.com/en/docs/basics/response-codes https://github.com/msramalho/twitter-response-codes https://github.com/bear/python-twitter

2 4. (D) Get the 25 most recent tweets from (A); 5. (E) Get the retweeters of (D); 6. (F) Get the retweets of (D); 7. (G) Get the 25 most recent tweets from (B); 8. (H) Get the retweeters of (G);

Twitter Watch (I) Get the retweets of (G); 10. (J) Get the 25 most recent tweets from (C); 11. (K) Get 10 tweets for every user in the database that did not have collected tweets; 12. (L) Calculate the 10,000 most common hashtags; 13. (M) Use the search API to get 200 tweets for each hashtag in (L). All the data was saved to a MongoDB instance, which has the out-of-the-box benefit of ensuring no duplicate documents exist , combining this with Twitter object model’s _id field, means that the redundancy requirement (7 in the requirements list) was easily achieved. We executed this script in a time window that encapsulated the October 6 th elections. Although this dataset is created coherently, there are a few subtle inherent limitations to the way the collection steps are designed. Firstly, by not having the full tweet history, we cannot conduct any analysis on how Twitter’s overall usage varied through time (on the Portuguese context). The same goes for analysis of each user’s usage patterns, and other analysis that require complete temporal data. Then, there is a hidden assumption that malicious activity will necessarily be within accounts that are either followers or followees of the seed accounts, or of the retweeters identified in (E) and (H). Then, (L) and (M) enrich and add variety, but their benefit is not exploited since no new accounts are expanded from those collected tweets. Also, and in line with a limitation common to all the open-source tools mentioned above, they did not allow for requirement 4 to be met, as the suspension of accounts was not easy to monitor or record. This approach was far from ideal to what we required. As the initial research focus was on combining textual content with structural information, we also went down another insidious path – attempting to save followers and followees of every account in the MongoDB instance. This proved hard due to quickly reaching MongoDB’s maximum document size of 16MB . Working around this was not advisable . A few options were considered, but we ended up going for something outside our comfort zone – Neo4J – a graph database designed specifically to save relationships between entities. After setting up a Neo4J Docker instance, we started using the old collection process with a few code changes that would ensure the follower/followee relationships would be saved. We had some success, as is visible in Figure 5.1 (p. 31), we were able to capture accounts, and their follow https://docs.mongodb.com/manual/reference/limits/ https://jira.mongodb.org/browse/SERVER-23482

3 relationships. However, this proved to be yet another dead end when we observed that the database- writes became the bottleneck of the process, and not Twitter’s API, no tuning or batch writing solved this problem.

Figure 5.1: Example visualization of Twitter "follow relationships" saved in Neo4j After having spent a significant amount of time on struggles in the collection process, we took a step back and decided to focus on designing a more deliberate system, even if at a greater time cost, which could answer the requirements above, as well as ensure two new ones: Data Collection Requirements 5.2: Additional requirements 1.

Separate watched from non-watched accounts, the first type consisting of accounts with content posted in Portuguese and ideally within Portugal’s Twittersphere; For every account that was marked as being watched, all their tweet history should be collected. This new system was dubbed

Twitter Watch . The next section introduces the architecture designed to answer Requirements 5.1 and 5.2.

Twitter Watch’s high -level architecture can be split into User Interface (UI), backend, and external services. Figure 5.2 (p. 32) contains a visual representation of this architecture, its components, and their interaction. 5

The UI is a pragmatic effort to streamline the management of the data collection process through an easier and faster diagnosis of the system’s state. The UI itself is agnostic to the which Twitter Watch backend it is connecting to and accepts as input the Internet Protocol (IP) address of any Twitter Watch API instance. The most updated version of the UI in production is available at msramalho.github.io/twitter-watch. The UI is hosted on GitHub Pages , it was developed with Nuxt.js and Vuetify . In terms of content, the current version has two main pages: statistics and logs. The statistics page, as seen in Figure 5.3, contains plots of how the number of users and tweets recorded in the database evolve over time, as well as a plot of the database size evolution through time. Additionally, it contains some overall statistics like the current number of users and tweets, and can easily be expanded to accommodate more information. Figure 5.3: Twitter Watch UI statistics page The logs page, as seen in Figure 5.4 (p. 34), has two main sections. On the left, there is a panel to explore all the different scripts that are run by the backend, as specified in Section 5.4 (p. 34). Currently running scripts are marked with a green dot that, when hovered, displays information on how long they have been executing. Furthermore, expanding any of the scripts reveals a list of the individual logs for the date and time their execution started. When one of these logs is clicked by the user, its output is fetched and displayed on the right panel ( cf. Figure 5.4, p. 34). 34

Twitter Watch https://pages.github.com/ https://nuxtjs.org/ https://vuetifyjs.com/en/ Figure 5.4: Twitter Watch UI logs page

Twitter Watch interacts with external services each with a particular goal. The most obvious one is the Twitter API, a quintessential part of Twitter Watch, where all the data comes from. On top of that, Twitter Watch uses Google Drive API for storing regular snapshots of the database in case there is a problem with the deployment. In practice, these snapshots are used for local data analysis as they constitute an easy way to download the compacted data and rebuild the database in any local deployment of MongoDB. Finally, out of necessity, we found that it is useful to have a way to receive notifications about unexpected collection errors. To achieve this, we used Pushbullet , a free push notification service that has a mobile client that can receive real-time push notifications. Push notifications were mostly used to detect errors in the most fragile processes like interaction with the Twitter API, building the database compressed snapshot, or uploading it to Google Drive. The logic behind this push notifications mechanism is isolated in the core container code and can easily be invoked anywhere else in the application, where any future user of Twitter Watch needs. Twitter Watch’s implementation rests on a scheduling system that is cap able of launching parallel processes, each with its logic, and the combination of individual tasks interacts indirectly, as these get their input from the database and write their output on it too – a holistic system. Although the system is designed to be both flexible and customizable, the out-of-the-box version is already capable of meeting all the desired requirements. Part of the flexibility comes from a configuration file used to dictate how the dataset should evolve over time, this configuration file is further explained in Section 5.4.1 (p. 36). The rest of the flexibility stems from the creation of a semantic folder structure along with the ability to add new features without having to change any core code. This last point is achieved https://developers.google.com/drive/

37 by isolating all the collection process logic into Jupyter notebooks, which are automatically interpreted according to their location in the aforementioned semantic folder structure. Figure 5.6 (p. 36) contains a high-level view of the backbone of Twitter Watch. It shows four different execution steps: launch, setup, run-once, and scheduled tasks. Each task is written in its own Jupyter notebook. The key to differentiating them is their location in the folder structure, namely the one visible in Figure 5.5. Each folder in the collection folder has a specific execution routine that maps directly into the implementation ( cf.

Figure 5.6, p. 36), as is described below. Figure 5.5: Jupyter notebooks semantic folder structure Initially, when the application is launched, there is a single line of execution. First, the configuration file is parsed and validated; then, the output folder structure is created if it does not exist; lastly, the Jupyter notebooks, where the collection logic is written, are converted into Python ( . py ) script files, as these can then be easily executed in separate processes. Afterward, the now converted Python script files inside the setup folder are executed. In our implementation, this step has tasks related to inserting an initial set of accounts into the database for subsequent exploration (these seed accounts come from the configuration file), and executing database migrations (these can be anything from creating indexes on the database to restructuring the database and can vary through time). These setup tasks are executed as parallel threads. Once all the setup tasks have completed, both the run-once tasks and the scheduled tasks launcher loop are executed in parallel. The run-once tasks differ from setup tasks by being able to co-exist with the recurrent tasks launched by the scheduled launcher. These tasks are often tasks that only need to be executed once, or that will be running non-stop until the application is manually closed. One-time data migrations fall into this category. In our case, we had a task running non-stop that simply logs the number of documents in each database collection at a custom interval. Finally, the scheduled launcher is the piece of the puzzle responsible for parsing the filenames inside the daily and hourly tasks and running them at the specified time. Note that other schedules can be added in the future, like weekly executions, with little effort due to the abstraction mechanisms implemented. Note also that the filenames of the files within the daily and hourly folders are used to infer the exact desired execution time. For instance, the filename { "seed": { "usernames": [] }, "collection": { "limits": { "max_watched_users": 100000000, "max_daily_increase": 25000, "max_daily_increase_ratio": 0.1, "min_appearances_before_watched": 10 }, "ignore_tweet_media": false, "oldest_tweet": "Sun Aug 1 00:00:00 +0000 2019", "newest_tweet": "Sun Aug 1 00:00:00 +0000 2030", "search_languages": ["pt", "und"], "max_threads": 8, "min_tweets_before_restricting_by_language": 10 }, "mongodb": { "address": "mongodb://USERNAME:PASSWORD@mongo:27017/", "database": "twitter", "drive_api_backup_enabled": true }, "notifications": { "pushbullet_token": "API TOKEN" }, "database_stats_file": "out/db_logs.csv", "seconds_between_db_stats_log": 10, "api_keys": "TWITTER API KEYS FILE" } Listing 1: Simplified example of the JSON configuration file Below is an explanation of the most relevant fields in the configuration file: • The seed field contains usernames of the seed accounts that serve as a starting point for the collection process; • The collection field specifies what the behavior of the collection process should be like, for instance: ◦ limits is a set of restrictions on the total size of the dataset and how it can grow in size on each day; ◦ oldest_tweet and newest_tweet restrict the time span during which Tweets are to be collected; 0 Twitter Watch ◦ search_languages is a list of language codes used to restrict tweet collection on accounts with more than min_tweets_before_restricting_by_language tweets, while not having a majority of them in one of the provided languages; ◦ max_threads is a global restriction on the number of threads each process uses when is parallelized through the class described in Section 5.5 (p. 41). • The mongodb field and its inner fields indicate the database access credentials, the default database name to use, and whether to perform this database backup to Google Drive ( cf.

Section 5.3.3, p. 34) or not; • The notifications field is used to provide the API credentials for Pushbullet ( cf.

Section 5.3.3, p. 34); • database_stats_file and seconds_between_db_stats_log specify the location and period of the database statistics collection process that is always running in the background; • api_keys is simply the filepath of a JSON file where a list of our Twitter API credentials are stored. This configuration file can quickly grow to meet the developer’s end -goals since adding a field in the JSON file will make it immediately available in any of the Jupyter notebooks. With the above knowledge, we can now delve into explaining how the different collection tasks work. A first remark has to do with making the most out of Twitter API keys, namely by isolating each used endpoint in its task, since the rate limits apply at the endpoint level. Knowing this, it should also be noted that the functionality was designed to be as modular as possible. The next subsections reveal the way the current implementation extracts Twitter data by going into detail on the most relevant implemented Jupyter notebooks and respective tasks.

Seed followers (daily)

This task runs daily and iterates over all the seed accounts specified in the configuration file updating the database with any new followers of those accounts by querying the GET followers/ids endpoint. All these accounts are marked as watched accounts.

Seed friends (daily)

This task is in all aspects similar to the previous one, differing only by focusing on friends ( a.k.a. followees) instead of followers, through the use of the GET friends/ids endpoint. https://developer.twitter.com/en/docs/developer-utilities/supported-languages/ api-reference/get-help-languages Account details (daily)

This task runs daily and iterates over all the accounts without a screen name property. This happens because accounts are added to the database by other tasks whenever a new account is found, but they typically contain a single _id property and not the complete profile information. It uses the GET users/lookup endpoint to hydrate the account objects. Appendix A (p. 105) contains an example of an object returned by this endpoint.

Tweet collection (daily)

This task runs daily and iterates over all the accounts that are marked as watched, or accounts that have not yet been excluded due to the restriction imposed by the parameter search_languages ( cf. Section 5.4.1, p. 36)). The logic behind filtering out accounts is focused on producing results, i.e. accounts and tweets that are relevant for the subsequent study’s goals. In this case, the focus is on restricting by proximity to the seed accounts and also to a set of specified languages. So, all the aforementioned accounts are then iterated and all their tweets are collected between the oldest_tweet and newest_tweet values, using the GET statuses/user_timeline endpoint, up to a limit of 3,200 tweets imposed by Twitter ( cf. documentation ). In fact, this task is optimized as every time an account’s tweets are collected, the _id of the last collected tweet is saved to the database and use in future calls as the since_id param. This is in accordance with the official optimization guidelines to minimize redundant API calls . Finally, this task also updates the most_common_language of each account by pre-processing its collected tweets, this property is used to restrict further iterations of the task from processing users whose most_common_language is not in search_languages , note that most_common_language is only set if an account has at least min_tweets_before_restricting_by_language tweets. Favorites collection (daily)

This task is similar in behavior to the previous one, with two differences. First, it collects liked tweets instead of posted ones, which is achieved through the GET favorites/list endpoint. Second, it restricts the iterated accounts to watched users only.

Tweet processing (daily) https://developer.twitter.com/en/docs/tweets/timelines/api-reference/ get-statuses-user_timeline https://developer.twitter.com/en/docs/tweets/timelines/guides/ working-with-timelines Twitter Watch

This task runs daily. It does not rely on making API calls. It iterates over all the unprocessed tweets in the database, so that each tweet is only processed once. For each tweet it isolates the ids of all the accounts that are related to that tweet: • all mentioned accounts; • the author of the tweet; • the author of the original tweet, if this is a retweet;• the author of the original tweet, if this is a quoted tweet; • the author of the original tweet, if this is a reply tweet. Each of these account ids is inserted in the database if it does not exist yet and the appearances counter for each account is incremented by one. This counter is used as a minimum threshold to start including the respective account into the watched users set; this behavior is configured by the min_appearances_before_watched field in the configuration file. Larger values of that parameter will lead to a slower yet potentially more relevant expansion of the watched users set. Seed tweet processing (daily)

This task is similar to the previous one. However, it is focused only on tweets by the seed accounts defined in the configuration file and the accounts identified are directly marked as watched, due to their proximity to the seed account and expected relevance to the dataset.

Add new watched users (daily)

The goal of this task is to iterate all users that are yet to be marked as watched or non-watched and insert the ones that meet the minimum number of appearances count as defined by the configuration field min_appearances_before_watched . Furthermore, the configuration field limits and its inner values will restrict the maximum number of newly watched users per day and, in extremis , will prevent the addition of any more watched users if the max_watched_users value has been reached. These configurations are meant to coerce the system to evolve more slowly to avoid resource overload, when that is necessary.

Hashtag tweet collection (hourly)

This task is executed every hour. First, it collects all the tweets in the database from the past 24h originating from the seed accounts. Then, the hashtags of those tweets are gathered and merged, and are then used to perform a language-restricted search (according to search_languages ), for each unique hashtag. We use the Standard search API, noting that this is a unique endpoint where the results are restricted to a 1% sample of all tweets in the Twittersphere of the past seven days. The end-goal of this task is to introduce some relevant variety to the collection process since these tweets will later be processed and influence the expansion process. Google Drive backup (daily)

This task has been mainly explained in Section 5.3.3 (p. 34) and it takes care of calling the mongodump command from MongoDB and then uploading the resulting database snapshot to Google Drive. https://docs.mongodb.com/manual/reference/program/mongodump/ Twitter Watch

Note on suspensions

It is important to highlight that on every API call that operates on account ids, namely looking up followers, friends, account information, tweet timeline, among others, Twitter Watch has a mechanism that wraps the errors returned by the API and, if a received error is related to the given account having been suspended (deletions, and private accounts are also recorded, for that matter), the database is updated to register this occurrence and its timestamp. This is useful for enriching the dataset with suspensions information.

Note on long-running tasks

The current version of Twitter Watch ensures that any scheduled task set to start its execution at a given moment is only launched if its previously launched instance (1 hour before for hourly tasks, and 24 hours before for daily tasks) has finished, in order to avoid excessive resource consumption.

After having Twitter Watch collecting data for a while, we noticed that the size of the database was large, and some of the tasks, such as the daily tweet collection or tweet processing, were taking more than the ideal 24 hours threshold. Inspired by the Map-Reduce algorithm [19], we developed our own implementation of a parallel processing mechanism that each task could benefit from. This mechanism requires only isolating the logic code in each notebook to a def task(skip, limit): method that performs the same query on the MongoDB database but appends .skip(skip).limit(limit) filters to the query. Once this change is ready, all a developer has to do is invoke the .run() method on the

DynamicParallelism(..., batch_size, max_threads) class. The batch_size and max_threads properties can be used to adapt the behavior of the parallel execution both in terms of the size of each batch of database documents to be processed as well as on the number of threads to use, respectively. The number of threads will default to the configuration field max_threads . A .reduce() method exists to merge all the outputs, but in practice, it has only been used for data analysis tasks and not actually in the collection process. In the end, this effort managed to significantly reduce the long- running tasks’ length to under the desired

24h limit.

First of all, the framework resulting from this effort was able to meet both the mandatory as well as the optional requirements specified in Requirements 5.1 and 5.2. Second, the development of Twitter Watch was incremental, since such a complex and byzantine system was impossible to effectively tune in advance while assuring few wasted resources. This led us 5 to give priority to things as they came up. In any case, we believe it served its purpose for our use case, and has led us to be hopeful about its potential to be adapted or used as-is for other research or industry efforts. In terms of results, we can look at its overall success as a consequence of both data quantity and quality it managed to collect. Although this section will focus mostly on quantity, the exploration Chapter 6 (p. 47) will inherently depend on the data quality, and we will let the results described there echo this notion quality. With that in mind, this section reports quantity results for the system where it is deployed, and the configurations used.

Deployment server

The current deployment server has an x86_64 architecture running Ubuntu 18.04.3 LTS; having 6 Central Processing Units (CPUs), with 8GB of Random Access Memory (RAM), 236GB of Solid State Drive (SSD) disk space, and running Docker version 19.03.11. The initial setup included only 2 CPUs and 4GB of RAM; this was considered a bit limited, especially when interacting with the machine through Secure Shell (SSH) while the collection process was under execution. The final setup proved to be sufficient, but no doubt that increasing processing power and available RAM could significantly speed up some operations such as database read/writes that use indexes, which can require a lot of memory and also benefit from available processing power. In fact, special care was taken to spread out the tasks that could compete heavily for the same resources, but this was done through trial and error.

Configurations

Although our initial approach described in Section 5.2 (p. 29) used 21 accounts as seed for the data collection process, we decided to enlarge it by taking into account all the accounts that fall under one of the categories in the following list that expands the original one: • Political party account; • Government ministry account; • President of a political party; • Minister in office; • Secretary of state in office; • Parliament deputy. The final number of accounts found is 101. These accounts match with the seed.usernames parameter in the configuration file. This seed identification process was conducted in February 2020. Other configurations match the values presented in Code Listing 1 (p. 37) excluding, of course, the placeholder values which do not influence the logic of the collection process, like the database credentials or the API tokens. 6

Twitter Watch

Output

At the time of writing this report, the system is still up, and will remain so for some time. For a period of little over two months, from April 15 th th Twitter Watch collection. In contrast, the following iterations for those accounts will only yield the most recent tweets, since the day of the previous collection. The flattening of the curve, although related, at the same time, to the bottleneck that was fixed by introducing parallel processing, was not surprising. Since the collection process focuses on quickly including all the accounts that are close to the seed accounts (and restricted by the search_languages field too), and then it will only increase when new accounts are detected in the tweet processing task. These tweets come from accounts that interact with the accounts already in the database but also from the hashtag tweet collection task. Hypothetically, we could have defined a threshold for the total amount of collected accounts, and the system would keep on collecting tweets for that fixed number of accounts – this could be a strategy to fight an explosion of the number of accounts. Experienced limitations

A final note on Figure 5.7 (p. 43) to explain some slight decreases in both curves. For our use case, we defined a few run-once tasks that would remove some accounts and their tweets from the database, and that explains some slight decreases in the curve. This is due to the first limitation of Twitter Watch, the fact that it relies highly on the language of the tweets to find relevant content. This is undesirable because, for instance, the English language is widely used, and it is hard to capture realities in English speaking countries without incurring the risk of having a lot of noise in the dataset since it will eventually expand to capture tweets in the same language but from other countries. For our disgruntlement, Portuguese is the official language of Portugal and several other countries, most relevant in practice, of Brazil. Actually, the official language is Brazilian Portuguese, but Twitter marks both types as "pt" . Having seen a non-negligible number of Brazilian Portuguese tweets in our data, we devised a task focused on identifying accounts with a location in Brazil and removing them. This was partially effective, but not entirely as many accounts do not specify their location and, therefore, remain in the dataset. Despite this, we were able to perform data exploration tasks ( cf.

Chapter 6, p. 47) in a way where it was easy to check if Brazilian accounts were influencing a given result, namely by looking at random tweet samples and manually looking for Brazilian Portuguese content, or at least, Brazilian-related content. Other Portuguese speaking countries hardly ever came up throughout the collection and following exploration processes.

The above sections have laid out the current standpoint of Twitter Watch, and how it is already in a version that allows for massive and structured data collection that satisfies the desired requirements. However, we envision several improvements in terms of usability and the collection process.

In terms of usability, we have thought of adding control functionality to Twitter Watch’s interface like the ability to launch or kill a given task, or even edit the configurations file on the fly. In terms of the collection process, we can highlight that the system’s scalability is limited to a vertical growth of resources, and we believe that larger collection processes might benefit from a horizontal approach, for instance, by using MongoDB’s sharding mechanisms to benefit from a multi-cluster system. In terms of already verified limitations, the language limitation could be mitigated by adding a filter for the watched accounts based on the location of the account. In fact, if the location is not explicitly provided, one can even rely o n existing research to infer an account’s location from its posted content [44]. Finally, we believe that Twitter Watch’s approach can be adapted to other data collection processes and APIs. Overall, we have taken a top-down approach to explain the inner workings of Twitter Watch. However, we believe that anyone using it from scratch will still need to spend a couple of days getting familiar with its overall structure if they intend to develop custom tasks. Even so, we have seen it working and are satisfied with the obtained results that enabled our analyses that would otherwise be limited to, and based on, an incoherent and incomplete dataset. https://docs.mongodb.com/manual/sharding/ Twitter Watch

Chapter 6

Exploration “You’ll never stumble upon the unexpected if you stick only to the familiar.”

Ed Catmull in Creativity, Inc.

This chapter, although only a piece of the entire developed work, represents our efforts of actually investigating the potential of using high-level methods for the detection of malicious behavior on Twitter. We start by describing the dataset we explored, a snapshot taken from Twitter Watch. As it is hardly avoidable, we also highlight several meaningful changes on Twitter behavior due to the Covid-19 [15] pandemic. Then, we report how we faced the challenge of finding labeled data and its importance. Finally, we get to explore the dataset with three complementary lines of focus: content, metadata and structure, from where we draw our conclusions and strive to open new research-worthy questions for the future.

The dataset studied in this chapter is a snapshot of Twitter Watch taken on May 18 th , 2020. However, we trimmed the snapshot to exclude tweets collected after May 15 th since these had only been partially collected. Indeed, this type of limitation on the collection process was later fixed, as is reported in Section 5.5 (p. 41). 47 50 Exploration

Table 6.1: Average total tweets per day of the week between August 8 th st pvalue < .

05 for Welsh’s test, or 0 otherwise, under the null hypothesis that each pair of days of the week have the same average of total daily tweets. The conclusion is that, during this period, we have weekends consistently distinct from all other weekdays, except for the Sunday-Friday and Sunday-Monday pairs, where this is not a statistically significant claim. Such a pattern was unexpected since weekdays typically mean less free time to engage in social activities, this was not the case, suggesting that weekends might represent, for some people, time to disengage from social media. Another explanation could be that the increase in weekdays activity is related to media accounts being themselves more active and generating more novel content for commentary. In any case, an initial temptation could be to use this knowledge as yet another feature in the task of troll detection, but going back to the warnings issued in Section 3.3 (p. 18) – any human behavior that can be automated becomes irrelevant the moment it is discovered . 8 Exploration

Concomitantly, to the present research effort, another

MSc student at the University of Porto, Tiago Lacerda, is developing a thesis that focuses on creating a classification model to identify Portuguese trolls [41]. In a mutual collaboration effort, we have decided to share information and results. In our case, we have received a list of 287 accounts labeled as trolls, of which 278 (96.9%) were already in our watched users set. These troll accounts are obtained in two ways. The first, manual labeling, this approach consists of manually assigning a score to accounts and consider the ones above a score threshold as trolls; this method yielded 237 trolls. The remaining 50 accounts are obtained by a model trained using the manually labeled trolls. For readability reasons, whenever this particular type of labeled accounts is used in the rest of the document, it will be referred to as peer . With the established knowledge that fake news is a real problem in today’s society and Online

Social Networks (OSNs) in particular. Especially, as fake news work as a catalyzer for political manipulation, namely in the fabrication, manipulation and propaganda categories ( cf.

Section 2.3.2, p. 11), we considered that capturing their dispersion would be relevant to our analysis. Appendix B (p. 107) contains a set of initial fake news websites, totaling 32, found through manual investigation, along with the four sources where they were taken from. We tried a new approach consisting of using those websites to uncover other fake news websites, as is detailed further in this section. Using the expanded_url associated with shared Uniform Resource Locators (URLs) in

Twitter’s tweet object, we can iterate the millions of tweets in our dataset to look for tweets, and their posters, that share links to fake news websites – in any tweet form: tweets, retweets, replies, and quotes. One unfortunate limitation is that some of the identified fake news pages are Facebook pages. We have decided to ignore these cases, as Facebook’s URLs are not easily mapped from the accounts page, for instance, a URL may be posted by a given page and have no identification back to that page depending on how it is viewed, and consequently shared. Because of this, 11 URLs were excluded, meaning the 32 websites were reduced to 21. After processing our data looking for the remaining fake news tweets, we were able to identify the most predominant of these websites. Figure 6.7 (p. 55) shows the different predominance for each of the websites that were found to have been tweeted. We found that, out of this pool of websites, a small amount of them have a very high presence namely lusopt.eu (530), noticiasviriato.pt (501), noticiasdem3rda.com (390), and bombeiros24.pt (259) https://noticiasdem3rda.com/ Exploration explain 88% of the total shares. In total, 1,898 shares were found associated with 715 different accounts. 61

Exploration { '_id': 1241507379742638080, 'full_text': ' → ֒ → ֒ → ֒ → ֒ → ֒ } Listing 2: Example tweet by @DavidMagarreiro Having had success in identifying fake news posters, we decided to use the existing data in search of other fake news websites. To achieve this, we looked at the 200 most shared websites from the top 50 fake news posting accounts. From these 200, we excluded the safe websites (youtube.com and instagram.com are two such examples, but many were also from Portuguese newspapers). Figure 6.9 (p. 57) shows the top posted websites by those accounts that were not immediately discarded for being well known, ordered by decreasing number of shares, we have also added a focus on distinguishing those websites that were part of the initial list of fake news websites. Here, we identified, using our knowledge and through manual verification of the existence However, we noticed that when this website is shared on Twitter, it looks very much like a fake news article, as each article’s title gets the majority of screen space. Indeed, we calculated the conditional probability of one of those accounts publishing news from this site ( inimigo ) knowing that they published one of the others ( others ): P ( inimigo T others ) P ( inimigo | others )= P ( others ) = = . = . This value is similar to those of other websites, whose average is 0 . All in all, the work reported in this section led to the identification of three types of different accounts. However, later in this report, in Section 6.3.1 (p. 59), we further identify 17 more troll accounts. The process through which that happens belongs in that section but we include those labeled accounts in Table 6.2 under the name

LDA found for completion purposes. Table 6.2: Types of labeled accounts and their presence in the watched accounts set Name Total Watched % in Watched Suspended 94,447 10,478 11.1 Peer 287 278 96.9 Fake news posters 2,765 2,633 95.2 LDA found 17 17 100

Having a well-described dataset along with a varied amount of labeled accounts, we now delve into investigating the type of approaches that can be used to perform high-level detection of malicious political behavior. The following work is exploratory in nature and could have taken many different contours. That being said, we focused on three main research lines: analyzing posted content, posted metadata, and interaction structure. With that in mind, we first present a Latent Dirichlet Allocation (LDA)-based content-oriented approach in Section 6.3.1 (p. 59) followed, in Section 6.3.2 (p. 74), by a briefer hashtag exploration, and finally an interaction-oriented approach with a focus on modeling communities of accounts by clustering an embedded network, in Section 6.3.3 (p. 80). These choices point back to the main types of approaches used in the literature, as demonstrated in Section 3.3.1 (p. 18).

59 4

Exploration

LDA is an unsupervised Machine Learning (ML) technique with typical applications in topic modeling. Topic modeling stands out as a very common approach to analyze Twitter content, especially when trying to find malicious activity starting from topic analysis [4] [23] [69] . Because of this, and its continued success as a state-of-the-art technique ( cf.

Section 4.3, p. 25), we now focus on using it on our Twitter data.

Literature shows that LDA typically works better when trained on larger documents than the typical tweet size. In other words, LDA produces worse results for microblogging [45]. Naturally, different approaches for constructing the documents fed into LDA have surfaced, and we felt the need to avoid the vanilla approach to pooling documents – using tweets as documents. [3] identifies four main ways of building documents suitable for LDA with Twitter data, namely: Tweet pooling each tweet is one document – vanilla approach; User pooling each document is composed of the whole history of a user – better than tweet pooling but limited to uses with small time-windows; Hashtag pooling all tweets with the same hashtag compose a document - tweets with multiple hashtags appear more than once, tweets without hashtags are either discarded or included as tweet pooling documents;

Conversation pooling a document is a tweet, along with the cascade (upwards or downwards) of answers, replies, and comments associated with it - yielded the best results in [3]. In [45] this list is expanded to include:

Burst-score pooling involves running a burst detection algorithm to detect trending terms, followed by grouping tweets with trending topics (typically hashtags), according to a burst-score;

Temporal pooling consists of pooling all the documents within a short time frame, especially when unexpected major events, confined in time, happen. The last two methods, by their exceptional nature, were not found to be a good fit for our use case. So, after some ad-hoc testing of the expectable document sizes for the previous approaches, we settled down for a combination of conversation pooling, hashtag pooling, and tweet pooling for the tweets that were not used by any of the two previous methods. However, we decided to introduce time as a variable to determine the included documents. This was achieved by performing pooling on a daily basis. Therefore our pooling approach consists of gathering documents for each day, for instance, for a given day d in the analyzed period, and then we perform: Conversation pooling

By gathering all the documents in day d that are either replies or quotes ( a.k.a. commented retweets as retweets add no new text content) and traversing their conversation tree backward. This traversing process stops when the root tweet is found or when the next parent tweet is older than d − Hashtag pooling

By considering only original tweets (no retweets), posted in day d and in Portuguese "pt" . These tweets are clustered by hashtag usage. If a tweet has h hashtags, its text belongs to h documents; Tweet pooling

By considering the tweets from the previous step – hashtag pooling – that had no hashtags. Following the pooling process for a given day, we clean the text of the documents by: • Removing URLs, numbers, mentions, hashtags, punctuation, and emojis; • Excluding stop words in Portuguese, English, Spanish, Italian, and French; • lowercasing all text; • Stemming words.

We tested two libraries for the LDA model training: Sklearn and Gensim . Both these libraries implement an online ( a.k.a. incremental) version of LDA, as originally proposed in [33]. This implementation was chosen for three reasons: it provides a faster training speed than the original implementation; it can be trained on each day’s data and then evaluated – we used this during our initial experiments for comparing libraries; and, being online, can easily be extended to be updated in real-time – an advantage that we do not explore but consider for future work. Practice showed that Gensim was about ten times slower than Sklearn for the same task, so we rely on Sklearn’s implementation for the rest of this report. We focused on training an LDA model for the elections time period as defined in Section 6.1 (p. 47), ranging from September 6 th to November 6 th , 2019. The online version of the LDA training algorithm has a different set of hyperparameters from the original implementation, we focused on tuning the following (in Sklearn nomenclature): doc_topic_prior prior of document topic distribution; topic_word_prior prior of topic word distribution; batch_size number of documents to use in each iteration; learn_decay controls the learning rate update; learning_offset (positive) value that downweights early iterations in online learning. https://scikit-learn.org/stable/modules/generated/sklearn.decomposition. LatentDirichletAllocation.html https://radimrehurek.com/gensim/models/ldamodel.html Exploration

Table 6.3: Number of accounts per account type – separating regular from malicious account types – for LDA dataset Account Type a.k.a. non-regular) 2,334 2.22 Considering the number of data-points, and the number of features in our vector – in this case, the number of topics (64), we chose K-means – actually K-means++ [5], a slightly modified K-means version – as the clustering algorithm. This was not our first choice, since we believed a hierarchical clustering approach could allow for an analysis at different clustering levels but the size of the dataset made many choices of algorithms – hierarchical clustering in particular – unfeasible. This infeasibility was first observed by the fact that experiments were taking exponential running times as the size of data-points used grew. This was also confirmed by an empiral benchmarking study of runtime for different clustering algorithms implemented in Python. Since K-means ended up yielding interesting results ( cf. Chapter 7, p. 89) there was no need to test other algorithms. Having settled for K-means, we started by training models for a variable number of clusters in order to identify the best number of clusters to choose for the final model. Figure 6.21 (p. 71) shows the evolution of the final model inertia according to the number of clusters. In this case, inertia is the sum of distances of all the data-points to their closest cluster centroid. A greater number of clusters is expected to reflect in a smaller inertia value. In fact, when the number of clusters matches the number of data-points this values is, naturally, zero. The elbow method is a technique for identifying the ideal number of clusters by looking for the inflection in the inertia vs. number of clusters curve. Taking this into consideration we settled on using a K-means model with 64 clusters. https://hdbscan.readthedocs.io/en/latest/performance_and_scalability.html Exploration metrics include the id of the cluster (“Cluster”); the number of accounts of the given type in the given cluster (“Count”); the same value as a percentage (“Count (%)”); the cumulative sum of this percentage (“Cum. (%)”); the absolute percentage difference between the given cluster size ratio (for the given account type) and the total cluster size ratio (“% Delta”); and finally, the index at which this cluster is at when sorting a given account type’s clusters by their delta (“Delta Index”). Since the regular account type represents 97.78% of the accounts, it is expected that the cluster distribution be very similar to its own distribution. Indeed, this results in minimal delta values for the regular class. Table 6.4: Metrics for the top 3 clusters with the biggest amount (count) of each account type Cluster Count Count (%) Cum. (%) % Delta Delta Index Type Biggest regular 1 st nd

31 9 12,962 8,806 12.63 8.58 12.63 21.21 0.18 0.05 2 nd th rd

35 6,591 6.42 27.63 0.59 1 st suspended 1 st nd

31 9 19 16 14.18 11.94 14.18 26.12 1.48 3.51 15 th st rd

7 7 5.22 31.34 2.24 5 th peer 1 st nd

35 31 59 23 39.33 15.33 39.33 54.66 32.33 2.53 1 st th rd

55 9 6.00 60.66 3.85 3 rd fakenews 1 st nd

35 31 733 442 34.97 21.09 34.97 56.06 27.16 8.49 1 st nd rd

39 174 8.30 64.36 4.29 3 rd lda 1 st nd

35 31 8 6 47.06 35.29 47.06 82.35 40.05 22.49 1 st nd rd

39 2 11.76 94.11 7.60 4 th This table further confirms what becomes apparent in the cluster visualization figure – that the account types peer , fakenews , lda are highly similar in terms of their cluster distribution, even when considering the overall distribution over the clusters. They share the top two biggest clusters (35, 31) with a large delta value at least for the first one (35) that are 32%, 27%, and 40%, respectively. Additionally, more than half of each of these account types are in those two largest clusters, 35 and 31. The suspended accounts, in turn, have a distribution that is not too distinct from the regular accounts, although we can see that their presence has the largest delta for cluster 9, unlike the other types of malicious accounts. 8 Exploration

Looking at some of these clusters allows us to understand the topical communities that those accounts form. Cluster 39, for instance, is mostly associated with accounts that discuss football. 0

Exploration we believe topical activity analysis met our expectations in terms of both allowing to perform a high level exploration of how different topics are being discussed on Twitter, on identifying political topics and their variation through time, on identifying the accounts that contribute the most for a given topic, and on how groups of topical accounts can be identified and associated with our labeled data-points. All of this with the adjuvant that our LDA pooling strategy proved sound enough to lead to the above-reported results.

In this section, we focused on using LDA for analyzing our dataset from a content perspective. To achieve this, we defined a strategy for performing document pooling that combined conversation pooling, hashtag pooling, and tweet pooling. We then tuned our topic model and checked how it performed during the period for which it was trained. We manually labeled each of the 64 topics in our model grouping them into politics, international politics, misc, and junk. Focusing on the political topics, we confirmed that the political topics had spikes in discussion focused on the election period, with the time before the election being more active in terms of political content than the period after. We identified 17 new troll accounts by looking at the 75 accounts that were producing the largest amounts of political content. We did not find a disproportionate distribution of those accounts in the topics with a larger amount of its content explained by those 75 users. We then clustered accounts based on their average topical distribution. We found that, within malicious accounts excluding suspended accounts, most were very similar to each other and disparate from regular accounts. Overall, we have found this analysis to produce meaningful results that reflect our expectations in terms of strongest political activity surrounding the election day, and also in terms of uncovered topics, as several were about national politics (10), some about international politics (8), and the remaining were either not discernable/junk (15) or miscellaneous (31).

Following our content-oriented exploration, we focused on using metadata information and exploring it to gather a better knowledge of how different account types are associated with different political ideologies. Metadata stands between semantic content and interaction. In our case, we focused on the hashtag feature of Twitter. As explained in Section 2.2 (p. 8), users can associate a set of terms to a tweet in the form

81 very interesting results in terms of an actual hashtag embedding model, as well as using that model to analyze how our previously labeled account types relate to the main Portuguese political parties. This affinity is not comparable inter-parties, but rather intra-party. Hence it does not represent the political orientation but can be analyzed to measure the variations within each party. In order to build a hashtag embeddings model we gathered all co-occurring the hashtags in a tweet. To train what is essentially a word-embedding model restricted to the set of hashtags used in the captured Twittersphere, we assumed each tweet to be a different document, and programmed the algorithm to ignore the proximity (order) of the hashtags used by enlarging the window size to be larger than the tweet with the most hashtags. Note that unlike the previous section ( cf.

Section 6.3.1, p. 59) we now use the complete tweets dataset, since we expect more data to be more beneficial than time-constrained data. Hashtags were also lowercased, and those that had non-ASCII characters – mostly related to non-Latin alphabet text – were excluded as they led to unexpected errors when training our models. The final dataset included 21,602,861 total hashtags, 1,610,106 of which are unique (7.45%) from a total of 6,861,372 tweets – after excluding tweets containing non-ASCII hashtags. The chosen library was Gensim’s Word2Vec . Table 6.5 contains the top 10 hashtags in our dataset, it is relevant to observe that many have to do with Covid-19 but also that the th top hashtag ( st nd rd th th th th th th th https://radimrehurek.com/gensim/models/word2vec.html Exploration

These ad-hoc tests were a first attempt at testing our results because we had previously tried smaller samples of our dataset, for instance, including only tweets marked as being written in Portuguese – .3 High-level Exploration After tuning our model, we intended to use it to label each account’s political affinity by calculating the average similarity between each Portuguese political party’s top hashtags and the ones used by the account. However, we did not find a way of ensuring that the chosen hashtags were equally representative of each party. Although we could not achieve this initial goal without first solving the above problem, we still studied this similarity between accounts and political parties for the sole purpose of comparing how the different account types we had previously labeled ( cf.

Section 6.2, p. 52) behave within each political party. To achieve that, we manually selected the top six hashtags representative of each political party. This task was performed by looking at their most used hashtags and excluding the ones that were not specific to the party like th legislative elections. These parties are listed below, approximately sorted downwards from left to right ideology: pcp Partido Comunista Português; verdes

Partido os Verdes; be Bloco de Esquerda; livre

Partido Livre; ps Partido Socialista ( most voted in the elections ); psd Partido Social Democrata; pan

Partido das Pessoas, dos Animais e da Natureza; il Iniciativa Liberal; cds

Partido do Centro Democrático Social – Partido Popular; chega

Partido Chega. After plotting the average account similarity distribution by political party and by account type, in Figure 6.26 (p. 78), we can observe that there is an interesting phenomenon – the account types peer , fakenews , and lda are distinctly separated from the regular account type (and also the suspended account type). This should be read as a clear sign that these types of accounts are overall interacting much more with political content than regular and suspended accounts. Indeed the suspended accounts seem to have a similar distribution to the regular ones. In most political parties, we see that the median of suspended is below regular accounts, except for pan , il , and chega . Also, the fakenews accounts, although significantly skewed upwards comparing to regular accounts and closer to peer and lda , still have a distribution consistently under those two last types. Table 6.6: Portuguese political parties sorted by decreasing mean delta for each account type Account Type Party Delta Account Type Party Delta suspended chega 0.004 peer chega 0.21 suspended il -0.000 peer cds 0.20 suspended pcp -0.011 peer ps 0.20 suspended pan -0.012 peer pcp 0.19 suspended ps -0.013 peer psd 0.18 suspended be -0.014 peer be 0.18 suspended livre -0.018 peer il 0.18 suspended verdes -0.019 peer pan 0.18 suspended cds -0.027 peer livre 0.17 suspended psd -0.031 peer verdes 0.17 Account Type Party Delta Account Type Party Delta fakenews chega 0.19 lda chega 0.22 fakenews ps 0.17 lda ps 0.21 fakenews cds 0.17 lda cds 0.21 fakenews pcp 0.17 lda be 0.20 fakenews be 0.16 lda pan 0.19 fakenews pan 0.16 lda pcp 0.19 fakenews psd 0.16 lda livre 0.19 fakenews livre 0.16 lda psd 0.18 fakenews verdes 0.16 lda il 0.18 fakenews il 0.15 lda verdes 0.17 These sub-tables reveal a few things. First, the suspended account type does not differ much from the regular account types, revealing once again ( cf. Section 6.3.1.6, p. 69 ) that the Twitter’s suspensions are not representative of the types of accounts that are considered outright malicious, such as the trolls in peer and lda , neither of the less explicitly malicious fakenews account type. Second, we see that the political party chega has a unanimous top 1 rank in the average delta difference to the regular accounts for every account type, with significant differences from the top 2 party for every account type. This observation is curious as this party is known especially for its populist views which, in theory, are expected to drive more attention and separation from the norm. Excluding the small differences observed in suspended account types, we also see that the top 3 delta values belong, besides chega , to ps and cds . chega is a far-right party, cds is a democratic right party, and ps is a socialist party and actually the party in power. These results could be questioned if we assume that the hashtag choice process, although performed impartially and similarly for all parties, actually led to hashtags that better capture the difference between regular Exploration and malicious account behavior. However, should that be the case, it means that the hashtags are themselves relatively the more “malicious” as the delta values we observe increase, which would lead to the same type of conclusions, simply under distinct assumptions.

In this section, we created an embeddings model of Twitter hashtags for the entirety of our dataset. The final model was tuned against a total of 741 capital-country analogies leading to the best precision@1 of 73.4%. This model was initially intended to m easure users’ political orientation based on the hashtags they used. This goal was not possible since the choice of hashtags representing each

87 party needed a level of equality that could not be measured or guessed. However, we manually gathered the six hashtags found to best represent each party and check how, for each party, the different types of accounts were distributed. We observed that suspended accounts are very similar to regular accounts for every party. The remaining malicious types of accounts are, like in Section 6.3.1 (p. 59), both very distinct from the other types and very similar in terms of proximity to political hashtags. For each of these types of accounts, we also observed that three parties – chega , ps , cds – showed the greatest distinction from regular users, with the populist farright party chega displaying a noticeably larger distance at the top of all differences. The fact that we were able to validate our model against capital-country pairs with a high precision@1, as well as the fact that the most common hashtags show the data collection process yielded data relevant to the Portuguese context gives some reassurance as to the meaningfulness of the embeddings model we built as well as the results stemming from it. This final exploration effort complements our analysis of content ( cf.

Section 6.3.1, p. 59) and metadata ( cf.

Section 6.3.2, p. 74) by introducing an analysis on structure information – in an attempt to capture the interactions between different accounts and see how these form interaction groups or communities. Due to the limitations explained in Section 5.2 (p. 29), our dataset does not include the follower/followee relationship for the collected accounts. Although that could have been a good starting point, we believe that it is not the best alternative when the goal is to map the interaction between accounts, mainly because a follow relationship is, first of all, unidirectional – leading to a small number of accounts having a large number of followers, thus to an unbalanced network – and, second of all, this relationship does not necessarily map the strength of a relationship since there is no weight describing how strong the bond is. To overcome these limitations we first built a dataset including all the retweet , reply , quote , and mention-by interactions. This choice relates to the belief that the amount of information used would make up for the discrepancies that some of these relationships – retweet , reply , and quote – share with the follow relationship, especially the unilateral direction and consequent unbalanced network. Using this dataset we isolated all the pairs of accounts A , B if: • A retweets a tweet from B ; • A replies to a tweet from B ; • A quotes a tweet from B (a retweet with a comment); • A mentions B in one of their tweets. In line with the work done in Section 6.3.2 (p. 74), we embedded these relationships with a Word2vec approach, considering each document as a single A , B pair. The result yielded a vector space that had some very dense regions. In practice, this means that when we try to find the most similar accounts to a given query account, the result consisted of a large number of accounts that had a cosine similarity of around 0.999 – a clear indicator that the embedding process was executed on very noisy data. Indeed the manual observation for some accounts known to the first author made no perceptible sense. We believe this was due to our embeddings model being fed directed relationships but being unable to look at them in this way, for one; and also because the data was noisy – for instance, many accounts are expected to retweet, reply or quote tweets from famous people or organizations. Taking a step back, we realized that in order to capture accounts that formed interaction communities, our dataset should not depend on inherently unbalanced information. The solution lay in using the mention phenomenon. From intuition, mentions only occur when someone believes that the mentioned accounts are interested in interacting with a given tweet, as it represents an implicit request for interaction. We assume, from experience, that mentions are not common when the mentioned accounts are not expected to look at the tweet and interact with it, with a similar probability of doing so. Hence this phenomenon is expected to diminish the unbalanced relationships from above. So, we built a dataset for the Word2vec model by using tweets as documents with words corresponding to the accounts mentioned in a given tweet. Not using the mentioning account, reduces the risk of our assumption from above, if wrong, affecting our results. The final tweak introduced was that of only considering tweets that include at least two mentions, since this then allows us to capture accounts that, in the eye of the account mentioning the others, are somehow related, and are related enough to receive a simultaneous mention. Although these are mostly intuition-based reasons, we do believe that they are significant to avoid including the amount of noise the previous dataset used, and to capture an implicit network of where accounts with similar interests and interactions are highly connected. In the end, this new dataset included a total of 64,215,170 mentions for a total of 3,901,678 unique accounts (6.1%). Indeed, we got better results when using this dataset. This notion stems from manual validation since, unlike in Section 6.3.2 (p. 74), we cannot easily create a set of analogies between accounts to validate our model. Because of this, the validation step for this model was based on manually observing the most similar accounts to a set of accounts known to the first author. The training hyperparameters used are alpha=0.025, size=128, negative=5 . Additionally, we forced to model to ignore accounts that had less than 25 occurrences to improve the results. From 82 Exploration

Table 6.7: Number of accounts per account type – separating regular from malicious account types – for mentions dataset after removing low occurrence accounts Account Type a.k.a. non-regular) 6,635 2.44

89 experience, this did not significantly reduce the number of labeled accounts captured by the model. Lastly, this model did not suffer from the density problem observed in the initial approach. The key idea here is that since we cannot perform a very good validation at this stage, we will have to use this model as-is and check if the outcome of that usage results in meaningful results – which it did, and that is why we did not need to go back and try other parameters. Table 6.7 contains the account occurrences by account type embedded in our final embeddings model. Figure 6.27 (p. 83) contains a print-screen of a Uniform Manifold Approximation and Projection (UMAP) visualization of the embeddings by using TensorFlow’s embedding projector . First impressions indicate that the embeddings do contain different groups of accounts – based on the embedded co-occurring mention relationship. https://projector.tensorflow.org/

85 Indeed, Table 6.8 further shows that cluster 60 is the one where the greatest percentage of these three account types are contained. Namely, we see that peer accounts have 63.30% of its total size in cluster 60, fakenews accounts have 32.16% of its total size in cluster 60, and lda accounts have 38.46% of its total size in cluster 60. This would not be very meaningful if the relative cluster size was within the same range of values, but actually the “Delta” values are 62.58%, 31.14%, and 37.45%, respectively.

This means that they are disproportionately distributed in cluster 60. We can also claim the exact same thing about clusters 19, and 9. These are the top 3 clusters which contain a cumulative value of 94.74%, 71.37%, and 76.92% of all the accounts in the three types specified before, respectively peer , fakenews , and lda , showing disproportionate distributions that are not as large but equally significant. Besides this observation that these three different types of malicious accounts are disproportionally distributed and clustered together in terms of co-occurring mention interactions, we see that the suspended accounts seem to have a distribution that is not as extremely different from that of regular users but does however show few clusters with a disproportionate presence, more noticeably in cluster 0, with a “Delta” difference of 10.96%. Table 6.8: Metrics for the top 3 clusters with the biggest amount (count) of each account type – mentions embeddings Cluster Count Count (%) Cum. (%) % Delta Delta Index Type Biggest regular 1 st nd

44 11 39981 20129 15.09 7.60 15.09 22.69 0.02 0.16 17 th rd rd

19 19257 7.27 29.96 0.09 5 th suspended 1 st nd

44 0 881 854 17.32 16.78 17.32 34.10 2.25 10.96 6 th st rd

28 356 7.00 41.10 0.24 45 th peer 1 st nd

60 19 145 62 63.60 27.19 63.60 90.79 62.58 19.83 1 st nd rd

9 9 3.95 94.74 3.40 10 th fakenews 1 st nd

60 19 420 409 32.16 31.32 32.16 63.48 31.14 23.96 1 st nd rd

9 103 7.89 71.37 7.34 4 th lda 1 st nd

60 19 5 3 38.46 23.08 38.46 61.54 37.45 15.72 1 st nd rd

9 2 15.38 76.92 14.84 4 th What is most curious in these results – that we believe is a good indication that the embeddings model is working – is that it confirms the results obtained in Section 6.3.1.6 (p. 69): Exploration • Suspension patterns differ from the trolling behavior identified in peer and lda account types, but also from the fake news posting behavior identified in the fakenews account type; • There is an obvious similarity between fake news posting accounts and accounts that display trolling behavior; • Besides content, structure also reveals strong similarities between peer , fakenews , and lda account types that is, at the same time, quite distinct from regular account activity. In this section, we tested several approaches at building an embeddings model that would map the interaction structure found in our Twitter dataset. We found that mapping the co-occurrence of hashtags led to a better model, although this was only subject to manual validation since an analogy validation was not suitable. With this dataset we were able to embed the relationships between accounts in terms of how they are mentioned in the same parts of the Twittersphere. Even with our validation limitations, we found that clustering the accounts represented in this embeddings model to yield very interesting results: although not coming from a prior solid model validation, they are in agreement with the results of Section 6.3.1 (p. 59) and Section 6.3.1 (p. 59) increasing our belief in their meaningfulness by propagation.

In this section, we forward a few potential validity threats that are inherently present in our analysis. These have to do with both the collection process as well as the exploration. However, they belong in the exploration section since that is where they can be seen as threats to the outcome. In terms of internal validity threats, that is, endogenous conditions to the way we collected and explored data, we highlight: • Although we used the same dataset ( cf.

Section 6.1, p. 47) to perform the three independent exploration efforts described in Section 6.3.1 (p. 59), Section 6.3.2 (p. 74), and Section 6.3.3 (p. 80), we did not use the exact same parts of that dataset. For instance, in Section 6.3.1 (p. 59) we fixed a time window to analyze, whereas in the remaining two sections, we used the entire timeline, restricted according to the requirements of that section. This choice could be a problem since we are comparing the results of those different sections. However, these comparisons focus on results per the account types defined in Section 6.2 (p. 52) and complemented in Section 6.3.1.5 (p. 66) and, for these account types, there is a majority of overlap in the different explorations. Additionally, we only included larger or smaller time windows when the methodology under use is expected to benefit from that action. Using more data for embeddings typically leads to better results and, since we did not use time as a variable that could not be a threat. Nevertheless, this comparison between results on accounts that is not completely overlapping can be seen as a validation threat;

87 4 • When selecting suspended accounts in Section 6.2.1 (p. 52) we used the most recent data from Twitter. However, it is possible that in a few months, with the benefit of hindsight, there might be more suspended accounts that, at this point, have not yet been included in the set. This situation, although not under our control, could have some impact on the results if the study was repeated at a later time; • The manual labeling process performed in Section 6.3.1 (p. 59) depended on being able to infer topics from a set of relevant words. Again, this was performed by the first author in the quality of an expert. This expertise derives from being a Portuguese citizen aware of a significant amount of both internal and external events, topics, personalities, and companies. However, there is always a margin for error as this effort was not done by independent expert subjects; • Most of the conclusions drawn from data exploration are somehow related to political topics ( cf.

Section 6.3.1, p. 59), and even political parties and ideologies ( cf.

Section 6.3.2, p. 74). These were mostly drawn by the first author of this work who has his own political views – as is common in such scenarios and analysis. Even by making every effort to avoid political biases, we believe this aspect is worthy of mention; • When identifying troll activity and malicious tweets, in Section 6.3.1.5 (p. 66), we defined a set of rules for considering a tweet as troll activity, and also a minimum of two malicious tweets in order to consider an account as a troll. In a transparency effort, we also included the identified tweets (sometimes more than two when doubt could arise), in Appendix F (p. 117). Even so, this analysis could fall under the same problem as the previous point: an implicit and unnoticed political bias; • Once again, the first-author was the only subject responsible for classifying the content of the newly discovered fake news websites ( cf.

Section 6.2.3, p. 54), which is based on his knowledge and ability to fact check news. • Finally, we need to highlight that our data collection process is very byzantine by design. This collection approach means it can collect relevant data without specifically being told to collect it but also that it can escape the design constraints of focusing on a given Twittersphere. We tried to measure this throughout the collection and analysis process and also reduce its impact by either excluding data ( cf.

Section 60, p. 44) or using approaches that are not expected to be affected by this diversity like embeddings ( cf.

Section 6.3.2, p. 74). In any case, this is a risk that we took, and one that we stand by. From an external validity point of view – relating to the generalizability of the results – a couple of other threats arise, namely: • The conclusions we drew stem from looking at data that is particular to a time window and a Portuguese context – we recognize that, although an effort exists to introduce factors and variables that could destabilize the outcomes – there is no guarantee that our conclusions will not age; • Even assuming the previous point is a non-issue, a bigger one came about during the

Exploration development of this work: Covid-19. Although we tried to measure the impact of this phenomenon on our data, namely in Section 6.1 (p. 47) and also in Table 6.5 (p. 75), it is hard to predict what the lasting impacts of the pandemic will be on the results of our work. 6

Chapter 7

Conclusions “Democracy could never be imposed at the point of a gun, but perhaps it could be sown by the spread of silicon and fiber.”

Edward Snowden in Permanent Record

The current thesis touches a multitude of topics and approaches, in this section we go over each of these. We shall present our conclusions, revisit our hypothesis, enumerate contributions and future work ideas that we believe will make sense given all that has been learned. Initially, we identified the inherent problem of the massive adoption of Social Networks (SNs) as sources of public information. Online Social Networks (OSNs) being at the center of a conflict between opinion manipulation and democratic values. The risk of compromised freedom of thought exists, it is recognized by the public, by governments, and by the very companies that built these OSNs ( cf.

Chapter 2, p. 5). Following the contextualization of our work, we investigated what the state-of-the-art approaches focus on. This was achieved via a Systematic Literature Review (SLR) process that stands by itself as a structured survey. Under the flagship of understanding current approaches on malicious political content detection on Twitter, we learned the answers to three main questions. We learned from SRQ1 that current studies focus on extracting account, content, network, and activity data. We learned from SRQ2 that data is used as input for Machine Learning (ML) classifiers, for data representation approaches (like Latent Dirichlet Allocation (LDA) and embeddings), for community detection, and for other tools and intermediary processing stages. We learned from SRQ3 that these analyses tend to focus on performing ad-hoc data exploration, developing bot classifiers, or better mapping the taxonomy of the malicious activity in OSNs. The last lesson from this effort was understanding there are several systematic dangerous assumptions in the literature. One is assuming malicious behavior is not adaptable and to classification methods that rely on easily manipulated features. The outcome is that these efforts have no real-world application and thus fail to meet their proposed motivations. Another is assuming that really dangerous malicious behavior can be seen with a magnifier at the account-level ( cf.

Chapter 3, p. 15). 89

Conclusions

We then proposed our hypothesis to defy these assumptions in an attempt to understand if high-level approaches can be used as an alternative, not forgetting that focus should be given to features that are not as easy to manipulate when manipulation is what we want to avoid:

Do different high-level approaches to Twitter data analysis, when applied to distinct types of malicious activity, a) lead to meaningful results and b) agree among themselves?

To answer it, we analyzed Twitter data within the Portuguese Twittersphere. To analyze this data, we had to collect it first. To collect it, we designed a new collection method – Twitter Watch. Twitter Watch stands as a configurable, dynamic, easily-deployable, parallel and robust framework for data collection under a set of conditions. The most relevant conditions are its usage of a set of seed accounts and its mechanism to restrict content by languages. The resulting dataset was massive and, except for some limitations – like other Portuguese- speaking countries’ content and indeed the Covid -19 pandemic impact – , adequate for our study. Following the release of this work, both Twitter Watch and the dehydrated dataset version will be open-sourced. Our dataset includes suspended account information, as per Twitter Watch’s design. It was enriched by leveraging known fake news websites to uncover even more, and then all of those were used to identify accounts who shared them. It was also enriched through a collaboration with a peer working on a parallel project and through a later analysis stemming from an LDA model trained on this dataset. The union of these four labeled account types is considered the malicious set of accounts. This enriched dataset was then described and explored in three main approaches: content, metadata, and structure ( a.k.a. network). The content-oriented approach revolved around using LDA topic modeling and subsequent analysis and exploration to understand how politics work in the Portuguese Twitter. We analyzed the 75 accounts that explained the greatest cumulative portion of each political topic and uncovered 17 new trolls. We also grouped accounts by topical similarity through clustering. These clusters revealed that all the labeled malicious types, except the suspended accounts, are disproportionally distributed when compared to regular accounts, and that this disparity is extremely similar regardless of which of these three account types we are looking at – fake news posters, peer-shared trolls, LDA found trolls. The metadata-oriented effort was initially envisioned as a means of finding political allegiance between the accounts in our dataset and the Portuguese political parties. We created an embeddings model of hashtags. Our initial goal was not feasible without solving an issue we failed to solve – how to find the hashtags that describe each political party equivalently – perhaps no perfect solution exists. However, we did manage to prove that building and tuning a hashtag embeddings model is possible, and that contribution may sparkle further research. Even with the apparent failure, we were able to perform an analysis of each party individually to find out how differently do malicious

91 and regular users relate to it. We also found that the malicious account types, except the suspended ones, are much more politically separated than regular accounts. 8 The structure-oriented exploration delved into embedding interaction data in order to identify groups or communities within the same “sub - twitterspheres”. A few datasets were built with retweet, reply, quote, and mention interactions that failed to produce any decent embeddings model. The tables were turned when considering only co-occurring mentions. This new dataset revealed that mapping mentions could produce very appealing results in terms of accounts with similar interaction behavior. This was especially observable when we clustered accounts by their embedding values and reached the exact same conclusions as in the totally independent content-oriented approach. Namely, that within malicious accounts, suspended accounts are different, and the remaining three types are both very distinct from regular accounts as they are significantly similar between each other. From all of these, we answer our hypothesis, as we have now seen that, for our preferential types of data analysis, these types of approaches yielded meaningful models which revealed a large agreement in terms of results, especially in understanding that Twitter’s suspension patterns are very distinct from the characterization of the remaining types of malicious accounts, whether in terms of content, metadata, or structure/interaction. These different exploration endeavors focused on avoiding the pitfalls of the approaches we had previously identified in the SLR process. If someone intends to manipulate political opinions, either through individual behavior or orchestrated campaigns, there are things that are mainly hard to escape: the nature of what you want to manipulate – its content; and the targets of that manipulation – the interaction communities. All in all, we can highlight the following contributions:

Systematic Literature Review

Conducted a SLR process to understand the current state of research on the types of data, types of processing, and end-goals of malicious political activity detection;

Twitter Watch

Designed and implemented a configurable, dynamic, easily-deployable, and robust framework for Twitter data collection. This tool was used to collect data of the Portuguese Twittersphere from August 1 st th Fake News

Using a simple technique that uses a seed of 21 known fake news websites, we were able to isolate all the accounts who shared them on Twitter (715 unique). With the top 50 accounts that share the most significant amount of fake news, we isolated all their 200 most shared

Conclusions websites and found 18 new fake news websites;

Meaningfulness

We showed that using the collected dataset led us to build meaningful models for different types of data used. The LDA model revealed how the election period of October 6 th Agreement

The three exploration approaches pursued – content, metadata, and structure – showed a significant agreement in terms of how the distinct types of malicious activity – suspended, fake news posters, peer shared trolls, trolls found with our LDA model – are related to each other and regular accounts. Emphasis on the suspended accounts that have a strong resemblance to regular accounts and the remaining malicious types having a high resemblance among themselves and very little resemblance with the suspended and regular accounts; Hypothesis

As briefly explained in the above two points, and more detailed in Section 7.1 (p. 90), we answered our hypothesis after having established our preliminary study ( cf.

Section 4.1.1, p. 24) based on understanding the types of information we can get from each of the three exploration approaches undertaken.

Our results are interesting but are only a first step into understanding what kind of high-level approaches work to analyze and detect malicious political content on Twitter. We believe many things can be considered for future work, namely:

Extrapolation in time

Pursue further research efforts that focus on validating our results through time, namely in future elections in Portugal, even though it is impossible to predict the existence of everlasting effects of the Covid-19 pandemic.

Extrapolation in context

Ensure that these results that were mostly focused on the Portuguese Twittersphere can be observed in other contexts. However, we expect countries where social media manipulation is more prolific to reveal different types of outcomes. For instance, the accounts most responsible for producing content on political topics might be a very sound way of finding highly widespread attempts at political manipulation. Twitter Watch

In order to scale in a way that can be both more manageable in terms of computing requirements and the amount of data collected, we believe Twitter Watch would benefit from having a way to grow horizontally through the use of a distributed database. Additionally, we envision Twitter Watch as a platform not only for data collection but also for monitoring a given Twittersphere by being able to execute previously built models in real-time. More, we believe 00 that expanding this framework to other social networks like Facebook could potentially result in the capitalization of the abstraction level we enforced on the tool, eventually leading to broader adoption;

Open-source

A final note on the future of this work pertains to the ability to make Twitter Watch, in the abovementioned capacity, open to any citizen, researcher, journalist, organization, or government as a means of political transparency and manipulation prevention. 94

Conclusions

Epilogue

A word from the first author, I saw the worst and the best of this SN, it was enlightening. I did not find any orchestrated manipulation effort, or maybe I did, and did not notice it. Perhaps that was actually the most startling observation stemming from this thesis. Let me explain. Orchestration was absent. Widespread extreme opinions were not. Racism, hate, and disinformation are prolific. I saw these proliferate in what can only be described as gullible users . Do take a moment to notice how we have extensively tried to use the term “account”, and not user, throughout this document. This is because users are expected to be made of flesh and bones, to be humans, people. Every person’s opinions and ideals influence the lives of many others, but the corrupted goals of a few are getting more attention, by marching on the back of a populism beast that can haze people into believing extreme nationalist messages the likes of which have proved to be catastrophic in the past. Twitter is the medium, we write the message. I believe technological efforts, like this one, are essential, but not enough to prevent intolerance from spreading. We need education, facts, and open-mindedness. Online Social Networks are a recent phenomenon. Most people did not grow up being taught about the dangers they pose. Not just to our privacy, but to our view on the world, to our beliefs, ideologies and, ultimately, our actions. This needs to change. The best way to fight the disinformation virus is not by using homeopathic machine learning, it is by inoculating people against it. Until then, I know efforts like this will not stop, nor should they, and neither should we tolerate the spread of false information and its proven power to lead people into intolerance. A simple action like reporting malicious accounts on Twitter can have a massive impact on how we help each other. We should tolerate everyone and everything, the only thing we cannot tolerate as accounts, as users, and as Humans is intolerance [51]. 95 96

Conclusions

References [1]

Amelia Acker. Data craft: The manipulation of social media metadata.

Data & Society Research Institute , 2018.

Cited on page 13. [2]

Hunt Allcott and Matthew Gentzkow. Social media and fake news in the 2016 election.

Journal of economic perspectives , 31(2):211 – Cited on page 11. [3]

David Alvarez-Melis and Martin Saveski. Topic modeling in Twitter: Aggregating tweets by conversations.

Proceedings of the 10th International Conference on Web and Social Media, ICWSM 2016 , pages 519 – Cited on page 59. [4]

Panagiotis Andriotis and Atsuhiro Takasu. Emotional bots: Content-based spammer detection on social media. , pages 1 –

8, 2019.

Cited on pages 19, 20, 21, and 59. [5]

David Arthur and Sergei Vassilvitskii. K-means++: The advantages of careful seeding.

Proceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms , 07-09-Janu:1027 – Cited on page 70. [6]

Adam Badawy, Aseel Addawood, Kristina Lerman, and Emilio Ferrara. Characterizing the 2016 Russian IRA influence campaign.

Social Network Analysis and Mining , 9(1):1 –

11, 2019.

Cited on page 2. [7]

Adam Badawy, Emilio Ferrara, and Kristina Lerman. Analyzing the digital traces of political manipulation: The 2016 Russian interference Twitter campaign.

Proceedings of the 2018

IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2018 , pages 258 – Cited on pages 19 and 21. [8]

Glenn Beck.

The Overton Window . Simon and Schuster, 2010.

Cited on page 13. [9]

Tim Berners-Lee, Robert Cailliau, Jean Francois Groff, and Bernd Pollermann. World-wide web: The information universe.

Internet Research , 2(1):52 –

58, 1992.

Cited on pages 1 and 5. [10]

David M. Beskow and Kathleen M. Carley. Bot conversations are different: Leveraging network metrics for bot detection in Twitter.

Proceedings of the 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2018 , pages 825 – Cited on pages 2, 11, 19, 20, and 21. [11]

David M Blei, Andrew Y Ng, and Michael I Jordan. Latent dirichlet allocation.

Journal of machine Learning research , 3(Jan):993 – Cited on page 25. [12]

David M Brown, Adriana Soto-Corominas, Juan Luis Surez, and Javier de la Rosa.

The

SAGE Handbook of Social Media Research Methods . SAGE Publications Ltd, 55 City Road, London, 2016.

Cited on page 9.

97 04

REFERENCES [13]

Pui-

Yee Chiu, Christy M K Cheung, and Matthew K O Lee. Online Social Networks: Why Do “We”

Use Facebook? In Miltiadis D Lytras, John M Carroll, Ernesto Damiani, Robert D Tennyson, David Avison, Gottfried Vossen, and Patricia Ordonez De Pablos, editors,

The Open Knowlege Society. A Computer Science and Information Systems Manifesto , pages 67 –

74, Berlin, Heidelberg, 2008. Springer Berlin Heidelberg.

Cited on page 6. [14]

Soon Ae Chun, Richard Holowczak, Kannan Neten Dharan, Ruoyu Wang, Soumaydeep Basu, and James Geller. Detecting political bias trolls in Twitter data.

WEBIST 2019 - Proceedings of the 15th International Conference on Web Information Systems and Technologies , pages 334 – Cited on pages 19, 20, and 27. [15]

Marco Ciotti, Silvia Angeletti, Marilena Minieri, Marta Giovannetti, Domenico Benvenuto, Stefano Pascarella, Caterina Sagnelli, Martina Bianchi, Sergio Bernardini, and Massimo Ciccozzi. COVID-19 Outbreak: An Overview.

Chemotherapy , 2020.

Cited on pages 47 and 48. [16]

Carmela Comito, Agostino Forestiero, and Clara Pizzuti. Word embedding based clustering to detect topics in social media.

Proceedings - 2019 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2019 , pages 192 – Cited on page 25. [17]

Kheir Eddine Daouadi, Rim Zghal Rebaï, and Ikram Amous. Organization, bot, or human: Towards an efficient twitter user classification.

Computacion y Sistemas , 23(2):273 – Cited on pages 19, 20, and 21. [18]

Isaac David, Oscar S. Siordia, and Daniela Moctezuma. Features combination for the detection of malicious Twitter accounts. , 2017.

Cited on pages 19 and 20. [19]

Jeffrey Dean and Sanjay Ghemawat. MapReduce: Simplified data processing on large clusters.

Communications of the ACM , 51(1):107 – Cited on page 41. [20]

J. Donath. Identity and Deception in the Virtual Community. In P. Kollock and M. Smith (Eds). Communities in Cyberspace.

Berkeley, CA: University of California Press. , 1997.

Cited on page 11. [21]

Nir Eyal and Ryan Hoover.

Hooked : how to build habit-forming products . New York, New York : Portfolio/Penguin, 2014.

Cited on page 6. [22]

Don Fallis. A Conceptual Analysis of Disinformation. iConference , pages 30 –

31, 2009.

Cited on page 10. [23]

Hossein Fani, Eric Jiang, Ebrahim Bagheri, Feras Al-Obeidat, Weichang Du, and Mehdi Kargar. User community detection via embedding of social network structure and temporal content.

Information Processing and Management , 57(2):102056, 2019.

Cited on pages 19, 20, 22, 25, 27, and 59. [24]

Paolo Fornacciari, Monica Mordonini, Agostino Poggi, Laura Sani, and Michele Tomaiuolo. A holistic system for troll detection on Twitter.

Computers in Human Behavior , 89:258 – Cited on pages 19 and 20. [25]

Guillaume Gadek. From community detection to topical, interactive group detection in Online Social Networks.

Proceedings - 2019 IEEE/WIC/ACM International Conference on Web Intelligence Workshops, WI 2019 Companion , pages 176 – Cited on pages 19, 20, 22, and 25.

REFERENCES

99 [26]

Zafar Gilani, Reza Farahbakhsh, Gareth Tyson, and Jon Crowcroft. A large-scale behavioural analysis of bots and humans on twitter.

ACM Transactions on the Web , 13(1):1 –

23, 2019.

Cited on pages 11, 19, and 21. [27]

Matthew Goerzen and Jeanna Matthews. Black hat trolling, white hat trolling, and hacking the attention landscape.

The Web Conference 2019 - Companion of the World Wide Web Conference, WWW 2019 , 2:523 – Cited on pages 7, 11, 12, 13, and 68. [28]

Christian Grimme, Dennis Assenmacher, and Lena Clever. Changing Perspectives: Is It Sufficient to Detect Social Bots? In

International Conference on Social Computing and Social

Media , volume 10913, pages 445 – Cited on pages 2, 11, and 21. [29]

Tahora H. Nazer, Matthew Davis, Mansooreh Karami, Leman Akoglu, David Koelle, and Huan Liu. Bot detection: Will focusing on recall cause overall performance deterioration?

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , 11549 LNCS:39 –

49, 2019.

Cited on pages 19 and 26. [30]

Claire Hardaker. Trolling in asynchronous computer-mediated communication: From user discussions to academic definitions.

Journal of Politeness Research , 6(2):215 – Cited on page 11. [31]

Julia Heidemann, Mathias Klier, and Florian Probst. Online social networks: A survey of a global phenomenon.

Computer Networks , 56(18):3866 – Cited on pages 1, 5, and 6. [32]

J E Hirsch. An index to quantify an individual’s scientific research output.

Proceedings of the National Academy of Sciences , 102(46):16569 – Cited on page 18. [33]

Matthew D. Hoffman, David M. Blei, and Francis Bach. Online learning for Latent Dirichlet Allocation.

Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010, NIPS 2010 , pages 1 –

9, 2010.

Cited on page 60. [34]

Isa Inuwa-Dutse, Mark Liptrott, and Ioannis Korkontzelos. Detection of spam-posting accounts on Twitter.

Neurocomputing , 315:496 – Cited on pages 19, 20, and 21. [35]

Di Jin, Liang Yang, Jiantao Huang, Dongxiao He, Yuxiao Huang, Pengfei Jiao, and Françoise Fogelman-Souliè. A novel generative topic embedding model by introducing network communities.

The Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019 , pages 2886 – Cited on page 25. [36]

N A Karlova and K E Fisher. A social diffusion model of misinformation and disinformation for understanding human information behaviour.

Information Research , 18, 2013.

Cited on page 10. REFERENCES [37]

Natascha Karlova and Jin Lee. Notes from the underground city of disinformation: A conceptual investigation.

Proceedings of the American Society for Information Science and Technology , 48:1 –

9, 2011.

Cited on page 10. [38]

James W. Kershner.

The Elements of News Writing . Pearson/Allyn and Bacon, boston, ma edition, 2005.

Cited on page 11. [39]

Sneha Kudugunta and Emilio Ferrara. Deep neural networks for bot detection.

Information Sciences , 467:312 – Cited on pages 11, 19, 20, and 21. [40]

Haewoon Kwak, Changhyun Lee, Hosung Park, and Sue Moon. What is Twitter, a social network or a news media?

Proceedings of the 19th International Conference on World Wide Web, WWW ’10 , pages 591 – Cited on pages 1, 6, and 8. [41]

Tiago Lacerda. TrollBus, an Empirical Study of Features for Troll Detection. Technical report, Faculdade de Engenharia da Universidade do Porto, 2020.

Cited on page 54. [42]

Jessica Li and H.R. Rao. Twitter as a Rapid Response News Service: An Exploration in the Context of the 2008 China Earthquake.

The Electronic Journal of Information Systems in Developing Countries , 42(1):1 –

22, jul 2010.

Cited on page 1. [43]

Robert M Losee. A Discipline Independent Definition of Information.

J. Am. Soc. Inf. Sci. , 48(3):254 – Cited on page 10. [44]

Jalal Mahmud, Jeffrey Nichols, and Clemens Drews. Where is this tweet from? Inferring home locations of Twitter users.

ICWSM 2012 - Proceedings of the 6th International AAAI Conference on Weblogs and Social Media , pages 511 – Cited on page 45. [45]

Rishabh Mehrotra, Scott Sanner, Wray Buntine, and Lexing Xie. Improving LDA topic models for microblogs via tweet pooling and automatic labeling.

SIGIR 2013 - Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval , pages 889 – Cited on page 59. [46]

Fred Morstatter, Liang Wu, Tahora H. Nazer, Kathleen M. Carley, and Huan Liu. A new approach to bot detection: Striking the balance between precision and recall.

Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2016 , pages 533 – Cited on pages 19, 20, and 26. [47]

Katarzyna Musial and Przemysław Kazienko. Social networks on the Internet.

World Wide Web , 16, 2012.

Cited on pages 1 and 7. [48]

Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford InfoLab, 1999.

Cited on page 18. [49]

Pujan Paudel, Trung T. Nguyen, Amartya Hatua, and Andrew H. Sung. How the Tables Have Turned: Studying the New Wave of Social Bots on Twitter Using Complex Network Analysis Techniques. , pages 501 – Cited on page 2. [50]

Diego Perna and Andrea Tagarelli. Learning to rank social bots.

HT 2018 - Proceedings of the 29th ACM Conference on Hypertext and Social Media , pages 183 – Cited on pages 19, 20, and 21. [51]

Karl Popper.

The Open Society and Its Enemies . Routledge, 1945.

Cited on page 95. [52]

Alec Radford, Luke Metz, and Soumith Chintala. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434 , 2015.

Cited on page 25. [53]

Tim O Reilly and Sarah Milstein.

The Twitter Book . Media, O’Reilly, 2009.

Cited on page 8. [54]

Daniel Rofrio, Anacaren Ruiz, Erin Sosebee, Qasim Raza, Adnan Bashir, Jed Crandall, and Ramiro Sandoval. Presidential elections in Ecuador: Bot presence in twitter. , pages 218 – Cited on pages 19, 20, and 21.

REFERENCES

101 [55]

Nikolaos Sachpenderis, Alexandros Karakasidis, and Georgia Koloniari. Structure and Content based Community Detection in Evolving Social Networks.

Proceedings of the 11th

International Conference on Management of Digital EcoSystems , pages 1 –

8, 2019.

Cited on page 25. [56]

Fabian Schneider, Anja Feldmann, Balachander Krishnamurthy, and Walter Willinger. Understanding Online Social Network Usage from a Network Perspective. In

Proceedings of the 9th ACM SIGCOMM Conference on Internet Measurement , IMC ’09, pages 35–

48, New York, NY, USA, 2009. Association for Computing Machinery.

Cited on page 5. [57]

Ross Schuchard, Andrew T. Crooks, Anthony Stefanidis, and Arie Croitoru. Bot stamina: examining the influence and staying power of bots in online social networks.

Applied Network Science , 4(1), 2019.

Cited on pages 19 and 21. [58]

Elham Shaabani, Ashkan Sadeghi Mobarakeh, Hamidreza Alvari, and Paulo Shakarian. An End-to-End Framework to Identify Pathogenic Social Media Accounts on Twitter.

Proceedings - 2019 2nd International Conference on Data Intelligence and Security, ICDIS 2019 , pages 128 – Cited on pages 19, 20, and 21. [59]

C. E. Shannon. A Mathematical Theory of Communication.

Bell System Technical Journal , 27(4):623 – Cited on page 10. [60]

Felipe Bonow Soares, Raquel Recuero, and Gabriela Zago. Asymmetric polarization on twitter and the 2018 brazilian presidential elections.

ACM International Conference Proceeding Series , pages 67 –

76, 2019.

Cited on page 19. [61]

Giorgos Stamatelatos, Sotirios Gyftopoulos, George Drosatos, and Pavlos S. Efraimidis. Deriving the political affinity of twitter users from their followers.

Proceedings - 16th IEEE

International Symposium on Parallel and Distributed Processing with Applications, 17th IEEE International Conference on Ubiquitous Computing and Communications, 8th IEEE International Conference on Big Data and Cloud Computing, 11t , pages 1175 – Cited on pages 19, 20, and 25. REFERENCES [62]

Kate Starbird, Jim Maddock, Mania Orand, Peg Achterman, and Robert M Mason. Rumors, False Flags, and Digital Vigilantes: Misinformation on Twitter after the 2013 Boston Marathon Bombing.

IConference , 2014.

Cited on page 6. [63]

Edson Tandoc, Zheng Lim, and Rich Ling. Defining "Fake News": A typology of scholarly definitions.

Digital Journalism , pages 1 –

17, 2017.

Cited on page 11. [64]

Kimmo Tuominen and Reijo Savolainen. A social constructionist approach to the study of information use as discursive action. In

Proceedings of an international conference on Information seeking in context , pages 81 –

96, 1997.

Cited on page 10. [65]

Joshua Uyheng and Kathleen M. Carley. Characterizing bot networks on twitter: An empirical analysis of contentious issues in the Asia-Pacific.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , 11549 LNCS:153 – Cited on pages 19, 20, and 21. [66]

Svitlana Volkova, Kyle Shaffer, Jin Yea Jang, and Nathan Hodas. Separating facts from fiction: Linguistic models to classify suspicious and trusted news posts on twitter.

ACL 2017 - , 2:647 – Cited on pages 11, 19, and 20. [67]

Samuel C. Woolley. Automating power: Social bot interference in global politics.

First Monday , 21(4), 2016.

Cited on pages 2 and 11. [68]

Lin Yue, Weitong Chen, Xue Li, Wanli Zuo, and Minghao Yin. A survey of sentiment analysis in social media.

Knowledge and Information Systems , 60(2):617 – Cited on page 1. [69]

Savvas Zannettou, Tristan Caulfield, William Setzer, Michael Sirivianos, Gianluca Stringhini, and Jeremy Blackburn. Who let the trolls out? towards understanding state-sponsored trolls.

WebSci 2019 - Proceedings of the 11th ACM Conference on Web Science , pages 353 – Cited on pages 19, 20, 21, and 59.

Appendices

Appendix A

Example of a Twitter API User Object { "_id": NumberLong(718009445863788544), "collected_at": ISODate("2020-04-05T17:40:29.333Z"), "created_at": ISODate("2016-04-07T09:36:06.000Z"), "depth": 0, "description": "Primeiro-Ministro de Portugal e @antoniocostaps. → ֒ Sigam também o XXII Governo em @govpt.", "favourites_count": 140, "first_collected_at": ISODate("2020-04-05T17:29:47.216Z"), "followers_count": 122171, "friends_count": 308, "location": "Portugal", "name": "António Costa", "profile_banner_url": "https://pbs.twimg.com/profile_banners/ "profile_image_url": "http://pbs.twimg.com/profile_images/ "screen_name": "antoniocostapm", "statuses_count": 2926, "url": "https://t.co/m7nAuRvyly", "verified": true } Listing 3: Example of a Twitter API user object for the current Portuguese prime minister 106

Example of a Twitter API User Object

Appendix B

Fake News Websites List from Manual Investigation

Source 1 – DN.pt • https://noticiasdem3rda.com/ • • • • https://jornaldiario.net/ • http://noticiario.com/ • • • • • • • • • https://tafeio.com.pt/ Source 2 – Reddit’s r/portugal • https://verdade.com.pt/ 108 Fake News Websites List from Manual Investigation

Source 3 – JN.pt • • • • • • • • • • • http://noticias24.com.pt/ • http://noticiario.com.pt/ • http://carregabenfica.pt/ • • https://asnoticias.pt/ • https://noticias.com.pt/ • https://ligaportuguesa.pt/ • • https://sonoticias.pt/ • https://livredireto.pt/ • https://remate.pt/ • https://futebol11.com/ • https://portoatemorrer.com/ • https://info24h.pt/ Source 3 – Author’s knowledge • Appendix C

Uncovered Fake News Websites from Inspection of Known Fake News Posters

Fake News tuga.press 2. flash.pt 3. vortexmag.net 4. inimigo.publico.pt 5. tuasaude.com 6. partilhado.pt 7. soutodaboa.com 8. apost.com 9. sabiaspalavras.com 10. seuamigoguru.com 11. elucubrativo.blogspot.com 12. palavrasoltas.com 13. noticiasdevizela.pt 14. direitapolitica.com 15. lusojornal2015.blogspot.com 16. pensarcontemporaneo.com 17. postal.pt 18. magazinept.com 110 Uncovered Fake News Websites from Inspection of Known Fake News Posters

Portuguese Newspaper Covers from October 19th 2019

Appendix D

Portuguese Newspaper Covers from October 19th 2019

Figure D.1: October 19 th Portuguese Newspaper Covers from October 19th 2019

113 Figure D.2: October 19 th Portuguese Newspaper Covers from October 19th 2019

Appendix F

Evidence of Malicious Activity in Manually Uncovered Trolls (LDA)

CSV list of user_id; screen_name; tweet_id; tweet_text; malicious_reason; link_to_twitter • • • • • • • • • -Acho muito bem que em Loures é só comunas e ciganagem.”; hate speech; link • • Evidence of Malicious Activity in Manually Uncovered Trolls (LDA)

Steve Brannon, despedido por Trump, decidiu vingar-se e organizou as revoltas nos EUA. As revoltas são uma operação militar, não popular.; conspiracy seeding/disinformation; link • • • • • • • • • • • • • • Evidence of Malicious Activity in Manually Uncovered Trolls (LDA)

119 link • • • • • • • • • • • • • • • Evidence of Malicious Activity in Manually Uncovered Trolls (LDA) • • • • Appendix G

LDA clustering – All Account Types Distributed per Cluster

Appendix H

List of Capital to Country Analogy Pairs

Table H.1: Capital to country hashtag pairs capital country capital country madrid spain sofia bulgaria paris france tokyo japan ottawa canada tunis tunisia berlin germany washington unitedstates london england stockholm sweden athens greece prague czechia amsterdam holland montevideo uruguay budapest hungary bratislava slovakia vienna austria moscovo russia zagreb croatia algiers algeria bern switzerland luanda angola belgrade serbia amman jordan brussels belgium riga latvia bucharest romania vilnius lithuania cairo egypt dakar senegal dublin ireland copenhagen denmark seoul southkorea quito ecuador oslo norway lima peru kiev ukraine warsaw poland maputo mozambique - - 124

List of Capital to Country Analogy Pairs

Appendix I