Covid-19 Tweeting in English: Gender Differences
CCovid-19 Tweeting in English: Gender Differences
Mike Thelwall, School of Mathematics and Computer Science, University of Wolverhampton, UK. Email: [email protected]. Orcid: 0000-0001-6065-205X Saheeda Thelwall, Institute of Health, University of Wolverhampton, UK. Email: [email protected] Orcid: 0000-0002-0333-399X At the start of 2020, COVID-19 became the most urgent threat to global public health. Uniquely in recent times, governments have imposed partly voluntary, partly compulsory restrictions on the population to slow the spread of the virus. In this context, public attitudes and behaviors are vitally important for reducing the death rate. Analyzing tweets about the disease may therefore give insights into public reactions that may help guide public information campaigns. This article analyses 3,038,026 English tweets about COVID-19 from March 10 to 23, 2020. It focuses on one relevant aspect of public reaction: gender differences. The results show that females are more likely to tweet about the virus in the context of family, social distancing and healthcare whereas males are more likely to tweet about sports cancellations, the global spread of the virus and political reactions. Thus, women seem to be taking a disproportionate share of the responsibility for directly keeping the population safe. The detailed results may be useful to inform public information announcements and to help understand the spread of the virus. For example, failure to impose a sporting bans whilst encouraging social distancing may send mixed messages to males.
Introduction
COVID-19 is, at the time of writing, a major global threat to public health (e.g., Lipsitch, Swerdlow, & Finelli, 2020). Public actions are critically important in slowing the spread of the virus and therefore reducing the death rate due to the volume of critically ill patients needing simultaneous care, such as by running out of ventilators. Governments around the world have reacted by announcing mandatory actions, such as shutting restaurants and the normal functioning of schools, and by giving strongly recommended or mandatory advice to the public for personal hygiene and social distancing to slow the spread of the virus. The extent to which the population follows expert health advice is expected to have a substantial impact on the death rate from the virus. If social distancing is widely ignored or misunderstood, for example, then national healthcare facilities will not be able to give all critically ill patients the care that they need to survive. It is therefore vitally important to assess how the public is reacting to the crisis and one way (amongst many) of investigating this is through social media posts, including tweets (e.g., Cinelli, Quattrociocchi, Galeazzi, et al., 2020), and one important potential arena of difference (amongst many) is gender. Twitter is a natural platform for public information sharing in many countries, including all large English-speaking nations. Although less popular than Facebook, its advantage for research is that it is typically fully public and researchers can therefore access its contents. Moreover, Twitter gives free use of an Applications Programming Interface (API) for automatically harvesting recent (up to a week old) tweets matching keyword searches, making it a practical source of data about public reactions to tweets. A disadvantage is that Twitter users’ demographics do not match those of the population. In the USA, about 23% of adults use the site, behind Facebook (71%) and Instagram (38%), but ahead of WhatsApp 18%) and Reddit (13%) (Schaeffer, 2019). Moreover, older people (and more at risk from COVID-19) are be less likely to use Twitter, men are slightly more likely to use it (50% female within a 52% female population) but adopters tend to be richer and more educated in the USA (Smith & Wojcik, 2019). There are also finer-grained differences, such as political variations between users and non-users (Smith, Hughes, Remy, & Shah, 2020). Nevertheless, analyzing tweets may give some quick large-scale insights into public reactions to COVID-19. This study focuses on gender differences in reactions to COVID-19 on Twitter. Since public safety measures must be adhered to by the entire population to be maximally effective, any gender differences in responses may point to weaknesses in public communications about the seriousness of the outbreak. This information may help with the creation of new messages targeting males or females more effectively. In addition, understanding gender differences may help modelling epidemiologists to create more accurate models of the spread of the disease. The current paper therefore analyses two weeks (March 10-23, 2020) of tweeting in English about COVID-19 from the perspective of gender differences in responses. Although the virus is a global pandemic, the focus on English is for pragmatic methodology reasons and similar research in other languages is encouraged (and supported by the free software at http://mozdeh.wlv.ac.uk).
Methods
The research design was to collect English-language tweets matching a set of queries related to Covid-19 over two weeks and to identify words used more by males than females, using these to point to aspects of gender difference in tweeting about the virus. A word frequency method is useful for gender comparisons because it gives statistically significant evidence in a transparent fashion. In contrast, content analysis or thematic analysis are unlikely to discover fine-grained gender differences and cluster-based methods, such as topic modelling, can be changed by small alterations in the data, and so are not robust. Topic modelling is also not able to give as fine-grained gender difference information as word frequency comparisons. Word frequency analysis therefore fills a gap in comparison to other methods. The following queries were used to identify different common ways of referring to the disease: coronavirus; “corona virus”; COVID-19; COVID19. These were submitted to Twitter at the maximum speed allowed by the free Twitter API from 10 to 23 March 2020, obtaining 3,038,026 tweets after eliminating duplicates (including multiple retweets) and near duplicates (tweets identical apart from @usernames and league was associated with tweets discussing the full or partial closure of various sporting competitions or facilities. Gender differences in this word therefore suggest that males were more likely to tweet that league-based sport was affected by COVID-19 restrictions. The word contexts varied from obvious (e.g.,
Results
The main themes identified in the tweets are summarized below by gender. The complete list of terms and raw tweet counts associated with them are available on FigShare (https:doi.org/10.6084/m9.figshare.12026625).
Male-oriented themes
Male-authored tweets about COVID-19 were about twice as likely as females to discuss sports, typically in the context of speculation about, or announcements of, events or competitions being cancelled (Figure 1). Whilst this is relatively peripheral to the disease, males were also substantially more likely to mention, or take issue with, political figures or government, particularly within India (Figure 2). Males were also more likely to tweet about the economy (terms: economy, market; not graphed). Figure 1. Sport-related terms with gender differences in usage. Figure 2. Politics-related terms with gender differences in usage. The epidemiology of the virus (Figure 3), including its geographic spread (Figure 4), was another male topic. Both relate to sharing news about the spread and extent of the virus. Figure 3. Epidemiology-related terms with gender differences in usage. Figure 4. Geography-related terms with gender differences in usage.
Female-oriented themes
Female-oriented themes seemed to focus on the first and second lines of defense against the virus. The key theme of social distancing is moderately female-oriented (Figure 5), in the sense that females were more likely to use the ixed gender
Two broad themes were mixed gender in the sense of some terms being male-associated and others being female-associated. Males were more likely to discuss the virus as a war whereas females were more likely to mention their anxiety about its effects (Fig 9). The war metaphor is a way of generalizing the situation as well as perhaps for males glamorizing actions against it, or emphasizing the seriousness of the issue. Thus, war metaphors could be an indirect way of expressing anxiety. Figure 9. Fight or worry-related terms with gender differences in usage. There were mixed gender differences in discussions of curfews (Figure 10). Whilst males were more likely to announce the existence of a curfew, females were more likely to discuss its practical impacts. Figure 10. Curfew-related terms with gender differences in usage.
Discussion
This quick analysis of gender differences in English tweeting about COVID-19 has several limitations. In addition to the issues discussed above, another important aspect is that Twitter does not report the geographic location of the tweets and so the data has unknown origins. In particular, if some countries have an unusually high proportion of active tweeters of one gender, then this could translate into tweets about that country statistically significantly associating with that gender with the tests used above. The results are broadly consistent with previous research into gender differences in language use, including on social media, and gender differences in interests. The primary ontribution here is therefore to so show which gender differences translate to COVID-19 on Twitter, rather than finding new gender differences. The greater male interest in sport in many countries is widely known (e.g., Plaza, Boiché, Brunel, & Ruchaud, 2017), and males also seem to discuss politics more (or at least more directly: Bode, 2017). The greater female focus on caring (65% of family caregivers are female in the USA: Family Caregiver Alliance, 2019), and family (Parker, Horowitz, & Rohal, 2015) has also been found before. In terms of language use, females have often been found to use personal pronouns more in some types of text (Argamon, Koppel, Fine, & Shimoni, 2003).
Conclusions
Although these conclusions are drawn from statistical tests on big data from Twitter, inferences from the results are tentative due to the processing limitations above that could not be addressed and the lack of evidence connecting offline actions to the content of tweets. Thus, for example, the greater female tendency to tweet about families does not prove that females were more concerned about the welfare of their families due to COVID-19, although this is a plausible explanation. Thus, the conclusions should be treated similarly to those of purely qualitative research: as evidence-based ideas but not proof of those ideas. The substantially greater focus of males on sport in tweets about COVID-19 might be taken as evidence that males were less serious about the disease in the initial stages. Irrespective of whether this is true, sport was an important factor in the reaction to COVID-19 for many males. A policy-related suggestion from this is that cancelling sporting events may be particularly effective in communicating to males the seriousness of a situation. For example, if the population is told to socially distance but allowed to attend mass sporting events on the basis that an alternative (watching the event in crowded pubs or bars) is more dangerous then this may send mixed messages since crowded sporting events clearly involve close proximity with large numbers of strangers. Thus, any relaxation of bans on sporting events should be considered very carefully in the future, in countries where they are in place, and sporting bans should be considered in other countries as an important component of social distancing strategies, both for the spreading risk and the message sent to (mainly) males. The results are consistent with, but do not prove, that women are at the forefront of actions to prevent the spread of COVID-19. Public health messages might therefore need to be particularly careful that core messages are transmitted effectively to women in media that they consume so that social distancing is fully understood by as many as possible so that it can be carried out as effectively as possible.