[PDF] Is Working From Home The New Norm? An Observational Study Based on a Large Geo-tagged COVID-19 Twitter Dataset

Abstract

As the COVID-19 pandemic swept over the world, people discussed facts, expressed opinions, and shared sentiments on social media. Since the reaction to COVID-19 in different locations may be tied to local cases, government regulations, healthcare resources and socioeconomic factors, we curated a large geo-tagged Twitter dataset and performed exploratory analysis by location. Specifically, we collected 650,563 unique geo-tagged tweets across the United States (50 states and Washington, D.C.) covering the date range from January 25 to May 10, 2020. Tweet locations enabled us to conduct region-specific studies such as tweeting volumes and sentiment, sometimes in response to local regulations and reported COVID-19 cases. During this period, many people started working from home. The gap between workdays and weekends in hourly tweet volumes inspired us to propose algorithms to estimate work engagement during the COVID-19 crisis. This paper also summarizes themes and topics of tweets in our dataset using both social media exclusive tools (i.e., #hashtags, @mentions) and the latent Dirichlet allocation model. We welcome requests for data sharing and conversations for more insights. Dataset link: http://covid19research.site/geo-tagged_twitter_datasets/

Full PDF

II S W ORKING F ROM H OME T HE N EW N ORM ? A N O BSERVATIONAL S TUDY B ASED O N A L

ARGE G EO - TAGGED

COVID-19 T

WITTER D ATASET

Yunhe Feng

Electrical Engineering & Computer ScienceUniversity of TennesseeKnoxville, TN 37996 [email protected]

Wenjun Zhou

Business Analytics & StatisticsUniversity of TennesseeKnoxville, TN 37996 [email protected]

June 16, 2020 A BSTRACT

As the COVID-19 pandemic swept over the world, people discussed facts, expressed opinions, andshared sentiments on social media. Since the reaction to COVID-19 in different locations may be tiedto local cases, government regulations, healthcare resources and socioeconomic factors, we curated alarge geo-tagged Twitter dataset and performed exploratory analysis by location. Speciﬁcally, wecollected 650,563 unique geo-tagged tweets across the United States (50 states and Washington,D.C.) covering the date range from January 25 to May 10, 2020. Tweet locations enabled us toconduct region-speciﬁc studies such as tweeting volumes and sentiment, sometimes in response tolocal regulations and reported COVID-19 cases. During this period, many people started workingfrom home. The gap between workdays and weekends in hourly tweet volumes inspired us to proposealgorithms to estimate work engagement during the COVID-19 crisis. This paper also summarizesthemes and topics of tweets in our dataset using both social media exclusive tools (i.e.,

Dataset link: http://covid19research.site/geo-tagged_twitter_datasets/ K eywords work from home · stay-at-home order · lockdown · reopen · spatiotemporal analysis · Twitter · COVID-19

The COVID-19 pandemic has had a widespread impact on people’s daily lives all over the globe. According to localpandemic conditions, countries worldwide adopted various containment policies to protect their residents and slowdown the spread of COVID-19. Although countries like Sweden and South Korea did not lock down cities during thepandemic, most of the other countries, including China, Italy, Spain, and India, imposed long and stringent lockdowns torestrict gathering and social contact. Inside the same country, different strategies and timelines were also set by regionsand cities to “ﬂatten the curve” and ﬁght against the COVID-19 crisis. People expressed various opinions, attitudes, andemotions on the same COVID-19 regulations due to local hospital resources, economic statuses, demographics, andmany other geographic factors. Therefore, it is reasonable and necessary to consider the location information wheninvestigating the public reactions to COVID-19.However, it is challenging to conduct such large-scale studies using traditional surveys and questionnaires. First,regulations and policies proposed and enforced in different regions are time-sensitive and changeable, making it hardto determine when surveys to be conducted and which survey questions to be included. For example, California andTennessee implemented stay-at-home orders on different dates. The initialized plannings and executive orders couldalso be tuned promptly, such as extending lockdowns due to the fast-growing COVID-19 conﬁrmed cases. Traditionalsurveys are not ﬂexible enough for such changes. Second, it is time-consuming and expensive to recruit a large number a r X i v : . [ c s . S I] J un PREPRINT - J

UNE

16, 2020of participants to take surveys, because demographics (especially geographical locations) must be considered. If acomparative spatial study is conducted, it takes more time to recruit participants from multiple regions.In this paper, we built a large geo-tagged Twitter dataset enabling ﬁne-grained investigations of the public reactions tothe COVID-19 pandemic. More than 170 million English COVID-19 related tweets were harvested from Jan. 25 toMay 10, 2020, among which 650,563 geo-tagged tweets posted within the United States were selected. We took theU.S. as an example to explore the public reactions in different regions because states in the U.S. determined when, how,and what policies and regulations were imposed independently. We ﬁrst presented an overview of both daily and hourlytweet distributions. Then, state-level and county-level geographic patterns of COVID-19 tweets were illustrated. Wealso proposed algorithms to evaluate work engagement by comparing tweeting behaviors on workdays and weekends.In addition, we extracted the involved popular topics using both social media exclusive tools (i.e., • A large geo-tagged COVID-19 Twitter dataset, containing more than 650,000 tweets collected from Jan. 25 toMay 10 2020 in the United States, was built and published at http://covid19research.site/geo-tagged_twitter_datasets/ . We listed tweet IDs for all 50 states and Washington D.C. respectively. • We proﬁled geospatial distributions of COVID-19 tweets at multiple location levels, and reported the differencebetween states after normalizing tweet volumes based on COVID-19 case and death numbers. For example, we foundresidents in Oregon, Montana, Texas, and California reacted more intensely to the conﬁrmed cases and deaths thanother states. • We deﬁned work engagement measurements based on the difference between workdays and weekends by hourlytweeting volumes. • When studying work engagement patterns after lockdown and reopen, we reporeted a few interesting ﬁndings. Forexample, the New York state showed lower work engagement than other states in the ﬁrst week under stay-at-homeorders. The average hourly work engagement in the afternoon (i.e., from 13:00 to 16:59) in the ﬁrst week of reopeningwas much higher than the ﬁrst week of staying at home. • We also conducted a comprehensive social sentiment analysis via facial emojis to measure the general pub-lic’s emotions on stay-at-home orders, reopening, the ﬁrst/hundredth/thousandth conﬁrmed cases, and theﬁrst/hundredth/thousandth deaths. We observed that negative moods dominated the public sentiment over these keyCOVID-19 events, which showed a similar pattern across states.

In this section, we ﬁrst provide an overview of tweet daily distributions, demonstrating when COVID-19 tweets becameviral. Next, the hourly distributions during different periods were illustrated. We then proposed methods to measurework engagement by comparing the hourly tweeting frequencies on workdays and weekends. We also studied theinﬂuence of COVID-19 regulations, such as stay-at-home orders and reopening, on work engagement.

Figure 1 shows the daily distribution of geo-tagged tweets within the top 10 states with the highest tweet volumes. We can see that daily tweet volumes generated by different states show similar trends. In fact, we tested statisticalrelationships regarding the daily volumes over time for all pairs of two arbitrary states, and found strong linearcorrelations existed among 93.2% state pairs with a Pearson’s r > . and p < . .Based on key dates, we split the entire observation period into the following three phases. • Phase 1 (from Jan. 25 to Feb. 24, 31 days): people mentioned little about COVID-19 except for a small peak at theend of January. • Phase 2 (from Feb. 25 to Mar. 14, 19 days): the number of COVID-19 related tweets began to increase quickly. OnFeb. 25 U.S. health ofﬁcials warned the COVID-19 community spread in America was coming [1]. On March 13,the U.S. declared the national emergency due to COVID-19 [2]. As mentioned in Appendix A.1, we lost around one-third detailed tweets between Mar. 18 and Apr. 4 due to the corrupted data.But we recorded the daily tweet counts during this period (see the dashed lines in Figure 13). Our crawlers shut down for 8 hours and9 hours On Mar. 27 and Apr. 23 respectively, which caused the data gaps in the two days. PREPRINT - J

UNE

16, 2020 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - o f t w ee t s CATXNYFLPA ILGAOHNJVA

Figure 1: The daily number of tweets from the top 10 states generating most tweets. • Phase 3 (from Mar. 15 to May 10, 57 days): people began to adjust to the new normal caused by COVID-19, such asworking from home and city lockdowns.

For each tweet, we converted the UTC time zone to its local time zone according to the state where it was posted. Theaggregated hourly tweeting distributions in different phases are shown in Figure 2. The tweeting behaviors on workdaysand weekends were studied separately because we wanted to ﬁgure out how the working status impacted on tweetingpatterns. We colored the tweeting frequency gaps during business hours (8:00-16:59) as green if people tweeted morefrequently on weekends than workdays. Otherwise, the hourly gap is colored as red. Local hour P D F ( % ) weekendworkday (a) All Local hour P D F ( % ) weekendworkday (b) Phase 1 Local hour P D F ( % ) weekendworkday (c) Phase 2 Local hour P D F ( % ) weekendworkday (d) Phase 3 Figure 2: Hourly distribution in three phases. The tweeting frequency gap during business hours are colored as green ifthe hourly frequency on weekends are higher than workdays. Otherwise, the gap is colored as red.In Phase 1, there existed a tweeting gap from 8:00 to 16:59 between workdays and weekends. The tweeting peakoccurred at 12:00-12:59 on weekends but at 17:00-17:59 on workdays. We think it may be explained by the fact thatpeople engage at work during regular working hours and have little time to post tweets on workdays. But they becomefree to express concerns on COVID-19 on Twitter after work.The hourly distribution patterns changed in Phase 2 when conﬁrmed COVID-19 cases increased quickly in the UnitedStates. People posted COVID-19 tweets more frequently during business hours than at the same time slots on weekends,indicating COVID-19 had drawn great attention of workers when they were working.It is interesting to note that a green tweeting gap from 8:00 to 16:59 reappeared in Phase 3 when most people had workedfrom home. These ﬁndings motivated us to take advantage of the tweeting frequencies on workdays and weekends toestimate work engagement in the COVID-19 crisis (see Section 4).

Twitter users can tag tweets with general locations (e.g. city, neighborhood) or exact GPS locations. In this section, weutilized the two types of tweet locations to explore geographic patterns of COVID-19 tweets at state and county levels. For states spanning multiple time zones, we took the time zone covering most areas inside the state. For example, we usedEastern Standard Time (EST) when processing tweets from Michigan because EST is adopted by most of the state. Except forArizona and Hawaii, we switched to Daylight Saving Time (DST) for all states after Mar. 8, 2020 PREPRINT - J

UNE

16, 2020

We extracted the state information from both general and exact tweet locations and calculated tweet volume percentagesfor each state, as shown in Figure 3(a). The most populated states, i.e., California, Texas, New York, and Florida,contributed the most tweets. In contrast, less populated states such as Wyoming, Montana, North Dakota, and SouthDakota created the least tweets. We measured the relationship between tweet volumes and populations for all states,and found a strong linear correlation existed (Pearson’s r = 0 . and p < . ).Then we normalized tweet volumes using state residential populations. Figure 3(b) illustrates Washington D.C. postedthe highest volume of tweets by every 1000 residents, followed by Nevada, New York, California, and Maryland.The rest states demonstrate similar patterns. We think the top ranking of Washington D.C. might be caused by itsfunctionality serving as a political center, where COVID-19 news and policies were spawned.Unlike state populations, we did not ﬁnd strong correlations between tweet counts and cumulative conﬁrmed COVID-19cases (Pearson’s r = 0 . and p < . ) or deaths (Pearson’s r = 0 . and p < . ). We further normalizedtweet volumes based on COVID-19 cumulative number of cases and deaths in each state. Figure 3(c) and Figure 3(d)shows the average number of tweets generated by each COVID-19 case and each death respectively. Note that Hawaiiand Alaska (not plotted in Figure 3(c) and Figure 3(d)) ranked as the ﬁrst and second in both scenarios. Residents instates like Oregon, Montana, Texas, and California reacted sensitively to both conﬁrmed cases and deaths, as thesestates dominated in Figure 3(c) and Figure 3(d). (a) Tweeting percentage in each state (b) Figure 3: State-level geospatial distribution across the United States

We utilized GPS locations to proﬁle the geographic distribution of COVID-19 tweets at the county level because generaltweet locations might not contain county information. In our collected geo-tagged tweets, 3.95% of them containedGPS locations. We resorted to Nominatim [3], a search engine for OpenStreetMap data, to identify the counties whereeach tweet GPS coordinate lay. Figure 4(a) and Figure 4(b) visualize the raw GPS coordinate and corresponding countydistributions. Large cities in each state demonstrated a higher tweeting density than small ones. In fact, we found astrong correlations between GPS-tagged tweet counts and county populations (Pearson’s r = 0 . and p < . ).4 PREPRINT - J

UNE

16, 2020But such correlations did not hold true for cumulative conﬁrmed COVID-19 cases (Pearson’s r = 0 . and p < . )or deaths (Pearson’s r = 0 . and p < . ). (a) Exact ( lat, lon ) coordinates (b) Geospatial distribution by county Figure 4: Distribution of tweets tagged with exact GPS coordinates at the county level

In this section, we ﬁrst propose methods to measure hourly and daily work engagement. Then, we investigate howstay-at-home orders and reopening inﬂuenced hourly and daily work engagement respectively. Note that we use theterm of “lockdown” referring stay-at-home orders in this section. The lockdown dates and reopening dates for eachstate are retrieved from Wikipedia [4] and New York Times [5] respectively.

We assumed that (1) people would tweet less frequently during working hours if they engaged more on their workingtasks; (2) if people spent no time on work tasks during business hours on workdays, their tweeting behaviors kept thesame as that on weekends, especially when people were conﬁned at home. We think the two assumptions are intuitiveand reasonable as both Phase 1 and Phase 3 showed the meaningful working-hour tweeting gaps in Figure 2(b) andFigure 2(d).We took the tweeting gap size as an indicator of work engagement. More speciﬁcally, let h ji denote the tweet volumeat the i -th hour on the j -th day in a week. For example, h meant the number of tweets posted from 8:00 to 8:59 onTuesdays (we took Monday as the ﬁrst day in one week). Accordingly, the total number of tweets on the j -th day ina week was represented by T j = (cid:80) i =1 h ji . The total tweet volumes on workdays and weekends can be expressed as T workday = (cid:80) j =1 T j and T weekend = (cid:80) j =6 T j .Note that we estimated both hourly and daily work engagement by considering at least seven days, because the datasparsity would lead to unreliable results if fewer days were involved. The work engagement at i -th hour H ( i ) could bedeﬁned as the ratio of the normalized tweeting frequency at i -th hour on weekends over that on workdays and minusone, as expressed in Equation 1. H ( i ) = (cid:80) j =6 h ji T weekend / (cid:80) j =1 h ji T workday − , (1)where i ∈ { , , , , , , , } . A larger positive H ( i ) indicates higher work engagement. When H ( i ) equals0, it means there exists no difference on work engagement at i -th hour between workdays and weekends. A positivevalue of H ( i ) implies people are more engaged at work on workdays than weekends. Although it is rare, a negative H ( i ) means people fail to focus more on their work on workdays than weekends.5 PREPRINT - J

UNE

16, 2020We also deﬁned the daily (from Monday to Friday only) work engagement by aggregating tweeting frequencies inregular working hours (from 8:00 to 16:59). The daily work engagement on j -th day was expressed as: D ( j ) = (cid:80) j =6 (cid:80) i =8 h ji T weekend / (cid:80) i =8 h ji T j − (2)where j ∈ { , , , , } and T j was the total tweet count on the j -th day. Similar to hourly engagement, a largerpositive D ( j ) means higher work engagement. We chose the ten states that generated the most massive tweet volumes from 8:00 to 16:59 on each day of the ﬁrst weekafter stay-at-home orders were enforced, to study the hourly and daily work engagement. Table 1 illustrates the hourlywork engagement of the ten states. Except for California and New York, all other eight states had positive average workengagement scores (see the second last column in Table 1), implying people worked more extensively on workdaysthan weekends. Georgia and Maryland demonstrated relatively higher average work engagements ( > . ). Acrossall the ten states, people focused more on work tasks at 10:00 and 13:00 than other hour slots, and reached the lowestengagement score at 11:00 (see the second last row in Table 1).Table 1: Hourly work engagement scores (ﬁrst stay-at-home week of each state) State Date

Mar 19 8,091 -0.016 -0.063 0.206 0.081 -0.018 0.136 -0.178 -0.184 0.009 -0.003 0.131 TX Apr 2 6,758 -0.109 0.083 0.195 0.104 0.272 0.256 -0.04 0.047 0.099 0.101 0.127 FL Apr 3 5,582 -0.123 0.057 0.367 -0.092 0.209 0.115 0.279 0.097 0.374 0.143 0.181 NY Mar 22 4,213 -0.366 0.025 0.065 0.164 0.174 0.157 -0.241 -0.056 -0.281 -0.040 0.208 GA Apr 3 2,334 -0.226 -0.108 0.328 -0.347 0.053 0.388 0.831 0.473 0.000 0.155 0.378 PA Apr 1 2,327 -0.078 0.368 0.144 0.201 -0.098 0.650 -0.214 0.092 0.094 0.129 0.262 IL Mar 21 1,639 0.337 -0.096 0.321 -0.296 0.252 -0.038 0.242 -0.089 0.033 0.074 0.223 MD Mar 30 1,598 0.151 0.266 1.275 -0.030 -0.336 -0.036 -0.073 0.455 0.767 0.271 0.497 VA Mar 30 1,595 0.460 0.065 0.792 -0.200 0.024 0.135 -0.240 0.182 -0.089 0.125 0.328 AZ Mar 31 1,508 0.381 -0.097 -0.160 -0.020 -0.038 0.624 -0.185 0.674 0.054 0.137 0.334

Avg.

Std.

State Date

Mar 19 8,091 -0.005 -0.042 -0.032 0.026 -0.001 -0.011 0.027 TX Apr 2 6,758 0.097 0.045 0.001 0.383 0.083 0.122 0.151 FL Apr 3 5,582 0.004 0.093 0.093 0.352 0.260 0.160 0.142 NY Mar 22 4,213 0.036 -0.103 -0.07 0.076 -0.115 -0.035 0.086 GA Apr 3 2,334 0.103 0.027 0.009 0.388 0.138 0.133 0.152 PA Apr 1 2,327 0.115 0.013 0.009 0.336 0.201 0.135 0.138 IL Mar 21 1,639 0.020 0.021 0.112 0.088 -0.009 0.046 0.051 MD Mar 30 1,598 -0.047 0.311 0.117 0.410 0.668 0.292 0.274 VA Mar 30 1,595 -0.097 0.190 0.127 0.112 0.124 0.091 0.110 AZ Mar 31 1,508 0.291 0.051 0.140 0.008 0.103 0.119 0.109

Avg.

Std.

PREPRINT - J

UNE

16, 2020in Figure 5, hourly and daily work engagement patterns of the same state are very similar along the nine weeks.States performed very differently one month before local lockdowns (see x-axis=-4 and x-axis=-5 in Figure 5(a) andFigure 5(b)). Surprisingly, most states achieved higher work engagement in the ﬁrst two weeks of lockdowns (seex-axis=Lockdown and x-axis=1 in Figure 5(a) and Figure 5(b)) than before lockdowns. -5 Week -4 -3 -2 -1 Lockdown 1 2 30.20.00.20.40.6 H o u r l y e n g a g e m e n t CA HourlyTX HourlyFL HourlyNY HourlyPA Hourly GA HourlyVA HourlyNC HourlyMD HourlyAZ Hourly (a) Average hourly work engagement per week -5 Week -4 -3 -2 -1 Lockdown 1 2 30.20.00.20.40.6 D a il y e n g a g e m e n t CA DailyTX DailyFL DailyNY DailyPA Daily GA DailyVA DailyNC DailyMD DailyAZ Daily (b) Average daily work engagement per week

Figure 5: Average hourly and daily work engagement in the ﬁrst ﬁve weeks before and three weeks after localstay-at-home orders were released

Some states had started to reopen partially since the end of April. We selected the states that were partially reopenedbefore May 3 to investigate their hourly and daily work engagement in the ﬁrst week of reopening. As Table 3and Table 4 show, averaged hourly and daily work engagement of the nine states except Alaska are positive. Peopledemonstrated much higher work engagement in the afternoon than in the morning (see the last second row in Table 3).Figure 6 demonstrates the afternoon work engagement of reopening is much larger than its counterpart in the ﬁrst weekof lockdowns. Also, the average work engagement of reopening on Tuesday and Friday improves a lot when comparingwith lockdowns. Table 3: Hourly work engagement in the ﬁrst week after reopen State Date

May 1 5,927 -0.308 0.215 -0.187 0.010 0.170 0.125 0.913 0.367 -0.098 0.134 0.360 GA May 1 2,049 -0.294 0.068 0.231 0.015 -0.011 1.105 0.140 0.567 0.753 0.286 0.438 TN May 1 1,280 0.236 -0.119 -0.116 0.242 -0.129 0.105 1.895 0.535 -0.061 0.288 0.643 CO Apr 27 990 -0.316 0.434 0.092 0.703 -0.206 0.300 -0.230 -0.133 0.021 0.074 0.344 AL May 1 603 1.453 -0.328 -0.398 0.472 -0.097 2.753 3.047 0.104 -0.146 0.762 1.336 MS Apr 28 317 0.204 -0.518 -0.037 -0.484 -0.259 1.108 2.372 1.409 3.014 0.757 1.293 ID May 1 184 0.523 0.692 -0.805 0.587 0.523 3.231 -1.000 -0.154 -0.683 0.324 1.275 AK Apr 25 142 0.898 0.898 -1.000 -1.000 -0.051 -1.000 -0.526 -0.431 -0.209 -0.269 0.748 MT Apr 27 106 1.786 -0.443 -0.071 0.114 1.786 -1.000 -1.000 -0.071 0.671 0.197 1.043

Avg.

Std.

In this section, we summarized and revealed the themes people discussed on Twitter. Social network exclusive tools(i.e., We made sure each reopened state had at least seven-day tweets after its reopening in our dataset (Jan.25 - May 10). PREPRINT - J

UNE

16, 2020Table 4: Daily work engagement in the ﬁrst week of reopening

State Date

May 1 5,927 0.053 0.196 -0.053 0.013 0.521 0.146 0.229 GA May 1 2,049 0.099 0.362 0.168 0.158 0.504 0.258 0.169 TN May 1 1,280 0.016 0.324 0.020 -0.019 0.958 0.260 0.414 CO Apr 27 990 -0.213 0.334 -0.077 0.049 0.502 0.119 0.294 AL May 1 603 0.321 0.367 0.230 0.062 0.660 0.328 0.219 MS Apr 28 317 0.310 0.787 0.239 0.226 0.245 0.361 0.240 ID May 1 184 1.110 0.327 -0.282 -0.231 0.108 0.206 0.564 AK Apr 25 142 -0.486 -0.020 -0.449 -0.327 -0.327 -0.322 0.183 MT Apr 27 106 -0.265 0.429 0.224 -0.095 0.457 0.150 0.320

Avg.

Std. H o u r l y e n g a g e m e n t LockdownReopening (a) Average hourly work engagement in st week Mon. Tue. Wed. Thu. Fri. D a il y e n g a g e m e n t LockdownReopening (b) Average daily work engagement in st week Figure 6: Average hourly and daily work engagement in the ﬁrst week of lockdowns and reopening

Hashtags are widely used on social networks to categorize topics and increase engagement. According to Twitter,hashtagged words that become very popular are often trending topics. We found

People use @mentions to get someone’s attention on social networks. We found most of the frequent mentions wereabout politicians and news media, as illustrated in Figure 8. The mention of @realDonaldTrump accounted for 4.5% ofall mentions and was the most popular one. To make Figure 8 more readable, the mention of @realDonaldTrump wasnot plotted. Other national (e.g., @VP, and @JeoBiden) and regional (e.g., @NYGovCuomo, and @GavinNewsom)politicians were mentioned many times. As news channels played a crucial role in broadcasting the updated COVID-19news and policies to the public, it is not surprising to observe news media such as @CNN, @FoxNews, @nytimes, and@YouTube are prevalent in Figure 8. In addition, the World Health Organization @WHO, the beer brand @corona, andElon Musk @elonmusk were among the top 40 mentions.

To further explore what people tweeted, we adopted latent Dirichlet allocation (LDA) [6] to infer coherent topics fromplain-text tweets. We created a tweet corpus by treating each unique tweet as one document. Commonly used textpreprocessing techniques, such as tokenization, lemmatization, and removing stop words, were then applied on eachdocument to improve modeling performance. Next, we performed the term frequency-inverse document frequency8

PREPRINT - J

UNE

16, 2020 S t a y h o m e Q u a r a n t i n e S o c i a l d i s t a n c i n g P a n d e m i c C o r o n a v i r u s p a n d e m i c S t a y a t h o m e T r u m p Q u a r a n t i n e li f e C o r o n a v i r u s o u t b r e a k C o r o n a v i r u s u pd a t e N Y C S t a y s a f e C o r o n a v i r u s u s a L o c k d o w n F l a tt e n t h e c u r v e W uh a n U S A C o v i d i o t M A G A C o r o n a p o c a l y p s e M a s k C h i n a B r e a k i n g T r u m p v i r u s W a s h y o u r h a n d L o v e N e w y o r k V i r u s F l o r i d a S t a y h o m e s a v e li v e C o r o n a o u t b r e a k B illi o n s h i e l d s B illi o n s h i e l d s c h a ll e n g e T r u m p li e s a m e r i c a n s d i e F a c e m a s k I n t h i s t o g e t h e r C a li f o r n i a L o s a n g e l e s H e a l t h c a r e h e r o e F r e q u e n c y o f h a s h t a g s COVID-19HealthcarePlacePoliticsOthers

Figure 7: Top 40 most popular @ p o t u s @ c nn @ v p @ n y g o v c u o m o @ c d c g o v @ s p e a k e r p e l o s i @ g o p @ w h i t e h o u s e @ j o e b i d e n @ w h o @ f o x n e w s @ n y t i m e s @ y o u t u b e @ m s n b c @ s e n a t e m a jl d r @ g a v i nn e w s o m @ s e a nh a nn i t y @ g o v r o n d e s a n t i s @ s e n s c hu m e r @ m a dd o w @ a b c @ i n g r a h a m a n g l e @ c b s n e w s @ n y c m a y o r @ n b c n e w s @ w a s h i n g t o n p o s t @ d o n a l d j t r u m p j r @ c o r o n a @ c h r i s c u o m o @ g o p l e a d e r @ p r e sss e c @ b e r n i e s a n d e r s @ b a r a c k o b a m a @ a o c @ t u c k e r c a r l s o n @ s e n a t e g o p @ g o v w h i t m e r @ c h r i s l h a y e s @ li n d s e y g r a h a m s c @ e l o n m u s k F r e q u e n c y o f m e n t i o n s PoliticianMediaOthers

Figure 8: The 40 most frequently mentioned Twitter accounts. The most popular mention @realDonaldTrump(accounting for more than 4.5%) are not displayed.(TF-IDF) on the whole tweet corpus to assign higher weights to most import words. Finally, the LDA model wasapplied on the TF-IDF corpus to extract latent topics.We determined the optimal number of topics in LDA using C v metric, which was reported as the best coherencemeasure by combining normalized pointwise mutual information (NPMI) and the cosine similarity [7]. For each topicnumber, we trained 500-pass LDA models for ten times. We found the average C v scores demonstrated an increasingtrend as the topic number became larger. But the increasing speed became relatively slow if more than ten topics wereconsidered. Therefore, we chose ten as the most suitable topic number in our study.The ten topics and words in each topic are illustrated in Table 5. We can see that Topic 1 is mostly related to statistics ofCOVID-19, such as deaths, cases, tests, and rates. In the topic of treatment, healthcare related words, e.g., “mask”,“patient”, “hospital”, “nurse”, “medical”, and “PPE”, are clustered together. Topic 3 is about politics, as top keywordsinclude “Trump”, “president”, “vote”, and “democratic”. The emotion topic mainly consists of informal languageexpressing emotions. Topic 5 is related to impact on work, businesses, and schools. We believe Americans who arebilingual in Spanish and English contributed to the topic of Spanish. Topic 7 is calling for unity in the community.In the topic of places, many states (e.g., Florida and California) and cities (e.g., New York and San Francisco) arementioned. The last two topics are about praying and home activities when people followed stay-at-home orders. Thesetopics are very informative and well summarize the overall conversations on social media.9 PREPRINT - J

UNE

16, 2020Table 5: Top 40 keywords for the 10 topics extracted using the LDA topic modelTopic 1 Topic 2 Topic 3 Topic 4 Topic 5 Topic 6 Topic 7 Topic 8 Topic 9 Topic 10Rank

Facts Healthcare Politics Emotion Business Spanish Community Location Praying Activities

In this section, we conducted a comprehensive sentiment analysis from three aspects. First, the overall public emotionswere investigated using polarized words and facial emojis. Then, we studied how sentiment changed over time at thenational and state levels during the COVID-19 pandemic. Finally, event-speciﬁc emotions were reported.

Emotionally polarized words or sentences express either positive or negative emotions. We leveraged TextBlob [8] toestimate the sentimental polarity of words and tweets. For each word, TextBlob offers a subjectivity score within therange [0.0, 1.0] where 0.0 is most objective and 1.0 is most subjective, and a polarity score within the range [-1.0, 1.0]where -0.1 is the most negative and 1.0 is the most positive. We used the subjectivity threshold to ﬁlter out objectivetweets, and used the polarity threshold to determine the sentiment. For example, a subjectivity threshold of 0.5 would10

PREPRINT - J

UNE

16, 2020only select the tweets with a subjectivity score greater than 0.5 as the candidates for polarity checking. A polaritythreshold of 0.7 treated tweets with a polarity score greater than 0.7 as positive and those with a polarity less than -0.7as negative.Figure 9 illustrates the ratio of the number of positive tweets over negative ones with different combinations ofsubjectivity and polarity thresholds. Positive and negative emotions evenly matched with each other when the ratioequaled one. We can see that emotion patterns changes along with threshold settings. Speciﬁcally, positive emotionsdominated on Twitter with small polarity and subjectivity thresholds. However, negative emotions became to overshadowthe positive ones under large polarity and subjectivity thresholds. Figure 10 shows three examples of polarized wordclouds where the ratio was greater than 1 (subjectivity=0.2, polarity=0.7), equal to 1 (subjectivity=0.8, polarity=0.2),and less than 1 (subjectivity=0.8, polarity=0.7). . .

05 0 . .

15 0 . .

25 0 . .

35 0 . .

45 0 . .

55 0 . .

65 0 . .

75 0 . .

85 0 . . Subjectivity threshold P o l a r i t y t h r e s h o l d Figure 9: The ratio of (a) Ratio > 1 (b) Ratio ≈ Figure 10: Polarized word clouds of different positive/negative ratios. (a) was generated with thresholds (subjectiv-ity=0.2, polarity=0.7), (b) with (subjectivity=0.8, polarity=0.2), and (c) with (subjectivity=0.8, polarity=0.7).

Besides polarized-text based sentiment analysis, we took advantage of facial emojis to further study the public emotions.Facial emojis are suitable to measure tweet sentiments because they are ubiquitous on social media, conveying diversepositive, neutral, and negative feelings. We grouped the sub-categories of facial emojis suggested by the UnicodeConsortium into positive, neutral, and negative categories. Speciﬁcally, all face-smiling, face-affection, face-tongue,face-hat emojis, and were regarded as positive; all face-neutral-skeptical, face-glasses emojis, andwere grouped as neutral; and all face-sleepy, face-unwell, face-concerned, face-negative emojis were treated as negative.A full list of our emoji emotion categories are available at http://covid19research.site/emoji-category/ .We detected 4,739 (35.2%) positive emojis, 2,438 (18.1%) neutral emojis, and 6,271 (46.6%) negative emojis in ourdataset. Negative emojis accounted for almost half of all emoji usages. Table 6 illustrates top emojis by sentimentcategories with their usage frequencies. The most frequent emojis in the three categories were very representative. Asexpected, still was the most popular emojis in all categories, which kept consistent with many other recent researchﬁndings[9, 10]. The thinking face emoji was the most widely used neutral facial emoji, indicting people were puzzledon COVID-19. Surprisingly, the face with medical mask emoji ranked higher than any other negative emojis. Theskull emoji appeared more frequently than any other positive and neutral emojis except . We think thesneezing face and the hot face emoji are very likely to be relative to suspected symptoms of COVID-19.11

PREPRINT - J

UNE

16, 2020Table 6: Top emojis by sentiment categories (numbers represent frequency)Positive 1608 522 199 189 176 173 171 162 153 150 139 138 13295 91 77 77 67 59 56 52 50 49 44 28 27Neutral 865 591 197 146 143 104 104 87 81 43 36 27 14Negative 1167 629 446 401 394 368 251 245 227 207 184 159 145117 104 102 95 75 70 70 68 68 63 59 54 47

We used facial emojis to track the different types of public sentiment during the COVID-19 pandemic. Figure 11(a)shows the daily overall emotions aggregated by all states. In Phase 1 (from Jan. 25 to Feb. 24), the publish emotionschanged in large ranges due to the data sparsity. In Phase 2 (from Feb. 25 to Mar. 14), positive and negative emotionsovershadowed each other dynamically but demonstrated stable trends. In Phase 3 (from Mar. 15 to May 10), negativesentiment dominated both positive and neutral emotions, expressing the public’s concerns on COVID-19.We also investigated the daily positive, neutral, and negative sentiment of different states as presented in Figure 11(b),Figure 11(c), and Figure 11(d) respectively. The top ﬁve states with the highest tweet volumes, i.e., CA, TX, NY, FL,and PA, were taken as examples. Similar to Phase 1 patterns in Figure 11(a), the expression of emotion by people indifferent states varied greatly. In Phase 2 and Phase 3, the ﬁve states demonstrated similar positive, neutral, and negativepatterns at most dates, as their sentiment percentages were cluttered together and even overlapped. However, thereexisted state-speciﬁc emotion outliers in Phase 2 and Phase 3. For example, the positive sentiment went up to 70% inPA when the Allegheny County Health Department (ACHD) announced there were no conﬁrmed cases of COVID-19in Pennsylvania on Mar. 5. People in New York state expressed more than 75% neutral sentiments on Feb. 27 whenthe New York City Health Department announced that it was investigating a possible COVID-19 case in the city. OnMar. 17 and Mar. 18, residents in PA demonstrated almost 100% negative sentiments when the statewide COVID-19conﬁrmed cases climbed to 100.

We studied the event-speciﬁc sentiment by aggregating tweets posted from different states when the same criticalCOVID-19 events occurred. We focused on the following eight events: • The ﬁrst, the th , and the th conﬁrmed COVID-19 cases, • The ﬁrst, the th , and the th conﬁrmed COVID-19 deaths, • Lockdown • ReopenFor the ﬁrst seven events, we aggregated the tweets in CA, TX, FL, NY, GA, PA, IL, MD, VA, and AZ, which werealso studied in Subsection 4.1.2. For the last event, we investigated the nine states of TX, GA, TN, CO, AL, MS, ID,AK, and MT, which kept consistent with Subsection 4.1.2. To our surprise, the average percentages of each sentimenttype in the eight events demonstrated similar patterns, as shown in Figure 12. We carried out the one-way multivariateanalysis of variance (MANOVA) and found the p-value was nearly 1.0, indicating there was no signiﬁcant differenceamong these eight event-speciﬁc sentiments. When the ﬁrst case, th cases, and ﬁrst death were conﬁrmed, sentimentstandard deviations were much larger than the rest events, suggesting people in different states expressed varying and12 PREPRINT - J

UNE

16, 2020 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - P e r c e n t a g e Overall positiveOverall neutralOverall negative (a) Daily overall sentiment aggregated by all states - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - P o s i t i v e p e r c e n t a g e CATXNYFLPA ← ACHD announced no confirmed cases in PA ← Bars and nightclubs closed in FL ← More people are leaving hospitals than arriving in NY (b) Daily positive sentiment by state - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - N e u t r a l p e r c e n t a g e CATXNYFLPA ← Investigating a possible case in NYC Statewide school closures in TX → Expand openings of businesses and activities in TX → Protesters rally against quarantines in PA → (c) Daily neutral sentiment by state - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - N e g a t i v e p e r c e n t a g e CATXNYFLPA

Cases climb to

100 in PA → → PAUSE order was extended in NYS

Death toll neared 1000 in CA → (d) Daily negative sentiment by state Figure 11: Emotion distribution by daydiverse sentiments at the beginning of COVID-19 outbreak. The negative emotion reached the highest level amongall events when th deaths were reported. The positive emotion achieved the highest level among all events whenstates began to reopen. This paper presents a large public geo-tagged COVID-19 Twitter dataset containing 650,563 unique geo-tagged COVID-19 tweets posted in the United States from Jan. 25 to May 10. A small number of tweets were missing during thedata collection period due to corrupted ﬁles and intermittent internet connectivity issues. We compensated for the datagaps using the COVID-19 dataset collected by Chen et al. [11]. As different COVID-19 keywords were used in [11]and our study to ﬁlter tweet streaming, it did not compensate for the missing data perfectly. However, given the small13

PREPRINT - J

UNE

16, 2020

First case First death 100 cases 100 deaths 1000 cases 1000 deaths Lockdown Reopening P e r c e n t a g e PositiveNeutralNegative

Figure 12: Even-speciﬁc sentiments. The means of positive, neutral, and negative emotions are very close but withdifferent standard deviations.proportion of missing data, we do not expect the conclusions to change. For more details about our dataset, please referto Appendix A.Based on the geo-tagged dataset, we investigated ﬁne-grained public reactions during the COVID-19 pandemic. First,we studied the daily tweeting patterns in different states and found most state pairs had a strong linear correlation. Thelocal time zones inferred from tweet locations make it possible to compare the hourly tweeting behaviors on workdaysand weekends. Their different hourly patterns during 8:00 to 17:00 inspired us to propose approaches to measurework engagement. Second, we utilized tweet locations to explore geographic distributions of COVID-19 tweets atstate and county levels. Third, we summarized and revealed the themes people discussed on Twitter using both socialnetwork exclusive tools (i.e.,

Appendix A Dataset

In this section, we ﬁrst described how we collected Twitter data and compensated for data gaps. Then we removedTwitter bots to enhance data analytics. At last, we extracted the U.S. geo-tagged COVID-19 tweets from general tweets.

A.1 Data Collection

We utilized Twitter’s Streaming APIs to crawl real-time tweets containing a set of “coronavirus”, “wuhan”, “corona”,“nCoV” keywords related to the novel coronavirus outbreak since January 25, 2020. After the World Health Organization(WHO) announced the ofﬁcial name of COVID-19 on February 11, 2020, we added “COVID19”, “COVID ー http://covid19research.site/geo-tagged_twitter_datasets/known_data_gaps.csv . To compensate for thesedata gaps, we sought for the COVID-19 dataset maintained by Chen et al. [11] and downloaded 16,459,659 tweets. - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - o f t w ee t s CATXNYFLPAILGAOHNJVA CA no corrputedTX no corrputedNY no corrputedFL no corrputedPA no corrputedIL no corrputedGA no corrputedOH no corrputedNJ no corrputedVA no corrputed

Figure 13: The daily number of tweets from the top 10 states generating most tweets. This is two days after Wuhan lockdown. PREPRINT - J

UNE

16, 2020

A.2 Data Cleaning

One of the challenges when dealing with messy text like tweets is to remove noisy data generated by Twitter bots.Inspired by the bot detection approach proposed in [12], we conceived the two types of Twitter users as bots: (1) thosewho posted more than 5000 COVID-19 tweets (more than 46 tweets on average per day) during our data collectionperiod; (2) those who posted over 1000 COVID-19 tweets in total and the top three frequent posting intervals covered atleast their 90% tweets. For the two types of bots, we removed 317,101 tweets created by 32 bots and 120,932 tweets by36 bots respectively.

A.3 Geo-tagged Data in the U.S.

Twitter allows users to optionally tag tweets with different precise geographic information, indicating the real-timelocation of users when tweeting. Typical tweet locations can be either a box polygon of coordinates specifying generalareas like cities and neighborhoods, or an exact GPS latitude and longitude coordinate. We detected and examined the“place” attribute in collected tweet JSON ﬁles. If the embedded “country_code” was “US” and the extracted state wasamong the 50 states and Washington D.C. in the United States, we added the tweet into our geo-tagged dataset. Afterremoving retweets, 650,563 geo-tagged unique tweets from 246,032 users in the United States were collected. Amongthem, 38,818 tweets (5.96% of our dataset) were retrieved from the dataset proposed by Chen et al. [11]. The monthlynumber of geo-tagged tweets in each state is shown in Figure 14.

AK AL AR AZ CA CO CT DC DE FL GA HI IA ID IL IN KS KY LA MAMDME MI MNMOMS MT NC ND NE NH NJ NMNV NY OH OK OR PA RI SC SD TN TX UT VA VT WA WI WVWY o f t w ee t s Jan.Feb.Mar.Apr.May

Figure 14: The monthly number of geo-tagged tweets in 50 states and Washington D.C. in the United States.

A.4 Twitter User Analysis

We further analyzed the users in our dataset to demonstrate they crowdsourced the public. Figure 15 shows the userproportion versus the number of posted tweets. We found only 0.055% users tweeted more than one geo-tagged tweetper day on average, generating 11,844 tweets (1.82% of all tweets) in our dataset. To be speciﬁc, 96.71% users had nomore than ten records in our dataset. Number of tweets in our dataset U s e r s ( % ) Figure 15: Scatter plot of number of tweets per user on log-log scale.15

PREPRINT - J

UNE

16, 2020

References [1] E.J. MUNDELL and Robin Foster.

Coronavirus Outbreak in America Is Coming: CDC , 2020.[2] The White House.

Proclamation on Declaring a National Emergency Concerning the Novel Coronavirus Disease(COVID-19) Outbreak , 2020.[3] OpenStreetMap.

Nominatim: a search engine for OpenStreetMap data , 2018.[4] Wikipedia.

U.S. state and local government response to the COVID-19 pandemic , 2020.[5] Sarah Mervosh, Jasmine C. Lee, Lazaro Gamio, and Nadja Popovich.

Coronavirus Outbreak in America IsComing: CDC , 2020.[6] David M Blei, Andrew Y Ng, and Michael I Jordan. Latent dirichlet allocation.

Journal of machine Learningresearch , 3(Jan):993–1022, 2003.[7] Michael Röder, Andreas Both, and Alexander Hinneburg. Exploring the space of topic coherence measures. In

Proceedings of the eighth ACM international conference on Web search and data mining , pages 399–408. ACM,2015.[8] Steven Loria, P Keen, M Honnibal, R Yankovsky, D Karesh, E Dempsey, et al. Textblob: simpliﬁed text processing.

Secondary TextBlob: Simpliﬁed Text Processing , 2014.[9] Dictionary.com.

What does the face with tears of joy emoji mean , 2017.[10] Yunhe Feng, Zheng Lu, Zhonghua Zheng, Peng Sun, Wenjun Zhou, Ran Huang, and Qing Cao. Chasing total solareclipses on twitter: Big social data analytics for once-in-a-lifetime events. In , pages 1–6. IEEE, 2019.[11] Emily Chen, Kristina Lerman, and Emilio Ferrara. Covid-19: The ﬁrst public coronavirus twitter dataset. arXivpreprint arXiv:2003.07372 , 2020.[12] Nikola Ljubeši´c and Darja Fišer. A global analysis of emoji usage. In