[PDF] A Machine Learning Approach to Predicting Continuous Tie Strengths

Abstract

Relationships between people constantly evolve, altering interpersonal behavior and defining social groups. Relationships between nodes in social networks can be represented by a tie strength, often empirically assessed using surveys. While this is effective for taking static snapshots of relationships, such methods are difficult to scale to dynamic networks. In this paper, we propose a system that allows for the continuous approximation of relationships as they evolve over time. We evaluate this system using the NetSense study, which provides comprehensive communication records of students at the University of Notre Dame over the course of four years. These records are complemented by semesterly ego network surveys, which provide discrete samples over time of each participant's true social tie strength with others. We develop a pair of powerful machine learning models (complemented by a suite of baselines extracted from past works) that learn from these surveys to interpret the communications records as signals. These signals represent dynamic tie strengths, accurately recording the evolution of relationships between the individuals in our social networks. With these evolving tie values, we are able to make several empirically derived observations which we compare to past works.

Full PDF

AA M

ACHINE L EARNING A PPROACH TO P REDICTING C ONTINUOUS T IE S TRENGTHS

A P

REPRINT

James Flamino † Department of PhysicsRensselaer Polytechnic InstituteTroy, NY 12180 [email protected]

Ross DeVito † Department of Computer Science and EngineeringUniversity of California San DiegoLa Jolla, CA 92093 [email protected]

Boleslaw K. Szymanski

Department of Computer ScienceRensselaer Polytechnic InstituteTroy, NY 12180 [email protected]

Omar Lizardo

Department of SociologyUniversity of California Los AngelesLos Angeles, CA 90095 [email protected]

January 26, 2021 A BSTRACT

Relationships between people constantly evolve, altering interpersonal behavior and deﬁning socialgroups. Relationships between nodes in social networks can be represented by a tie strength,often empirically assessed using surveys. While this is effective for taking static snapshots ofrelationships, such methods are difﬁcult to scale to dynamic networks. In this paper, we proposea system that allows for the continuous approximation of relationships as they evolve over time.We evaluate this system using the NetSense study, which provides comprehensive communicationrecords of students at the University of Notre Dame over the course of four years. These recordsare complemented by semesterly ego network surveys, which provide discrete samples over timeof each participant’s true social tie strength with others. We develop a pair of powerful machinelearning models (complemented by a suite of baselines extracted from past works) that learn fromthese surveys to interpret the communications records as signals. These signals represent dynamic tiestrengths, accurately recording the evolution of relationships between the individuals in our socialnetworks. With these evolving tie values, we are able to make several empirically derived observationswhich we compare to past works.

Introduction

Relationships and the interactions that characterize them are a deﬁning features of social networks [1–3]. In the networkand social sciences, the strength of these relationships are often represented by a “tie strength,” a weighted edge betweentwo nodes that marks the existence of a connection between the people portrayed by the nodes. Previously, work onunderstanding tie strength has ranged from interpreting its importance in information spread [4–8] to using a variety ofsocial features to predict the magnitude of tie strength between individuals [9–13].A question fundamental to this topic is: what contributes to the strength of a tie between two people? Or, morespeciﬁcally, what attributes of a relationship can we use to predict a tie strength value that properly represents thecloseness of two individuals within a social network? This question has no singular answer, though there have beenpopular works delving into possible interpretations [4,5,7]. Such works have pointed to both qualitative and quantitative † Authors contributed to this work equally a r X i v : . [ c s . S I] J a n PREPRINT - J

ANUARY

26, 2021attributes of relationships that seem to inﬂuence the strength of a the relationship between two individuals, and thereforewould contribute to the evolution of tie weights within the involved social network.In Granovetter’s popular early work on this topic [4] these factors were identiﬁed as time invested, emotional intensity,mutual conﬁding, and reciprocal services. He suggested tie strength was ultimately a linear combination of thesefactors. Krackhart’s response to this work [5] introduced an alternative characterization of tie strength that consisted ofinteraction frequency, affection, and time, which he deﬁned qualitatively as an enduring history between the two linkedindividuals.Marsden’s work provided further clariﬁcation to the considered factors, introducing predictors (aspects of relationshipsthat are related to, but not a part of, tie strength), and indicators (actual components of tie strength). The former setof factors contain relationship descriptors like kinship and educational differences. The latter set of factors containedattributes of communication and shared interests, and intimacy that are more commonly seen as features of tie strengthin other works. In particular, Marsden addressed closeness (emotional intensity), duration of connection, frequencyof communication, breadth of discussion topics, and mutual conﬁding (all of which correspond to Granovetter’scharacterizations of tie strength) and found that closeness played an important part in informing tie strength.Given the subjective nature of relationships and their attributes like intimacy and affection, a more robust and gen-eralizable quantitative approach to prediction faces some challenges, though there has been groundwork laid to thisend [9–13]. In some of these works, tie strength is approximated by linking features like communication frequency,social media friend overlap, shared attributes (like gender or education), directed message keyword usage, and the liketo predict closeness between two individuals. The predictions are then usually compared to a ground truth extractedfrom a survey asking participants to rate their closeness either on a numbered scale or indirectly by using questions like“How strong is your relationship with this person?”.Facebook has become a prevalent medium for these kinds of experiments. Since among the online social mediaplatforms, Facebook maintains a massive social network and facilitates broad forms of interaction between users. Theresults of these experiments revealed that features like days since the last communication, participant’s number offriends, and exchanged intimacy words contribute a fair amount to the prediction of tie strength. In addition to this,some of the work showed that public communications, like Facebook wall posts, and private communications, likeprivate Facebook direct messages, often contribute equally to predict tie strength.Despite the interesting implications of these works, the scope of these systems are always limited, restricted to asnapshot in time of the social network. But as most people’s lives are constantly a witness to, relationships evolve overtime. They can be subject to changes, and such changes will have a direct impact on how people interact with former,current, and future friends. In fact, the progression of a relationship is important for characterizing the connection’sstrength. All of the past works mentioned above only focus on predicting tie strength at a single moment in time.Additionally, these models often faced the issue of being tied to their speciﬁc application. The representations of tiestrength were often characterized by attributes extracted from a singular platform, namely Facebook. This results ininterpretations of tie strength that are deﬁned by their speciﬁc platforms, making them incapable of being appliedgenerally.In this paper, we lay out a generalizable system that addresses these concerns and demonstrates that evolving tiestrengths for a dynamic social network can be accurately predicted given only a practically small survey-based groundtruth. There are three core pieces to our system: the input data, the training data, and the model that learns to interpretthe input data as tie strengths using the training data.A person’s digital communication records are used as the input data, from which a trained model can predict their socialties. While communication is just one of the many hypothesized aspects of a relationship which impacts the tie strength,as a data source, digital communications have the practical beneﬁts of being abundant, multifaceted, and easy to collect.These advantages grow as the world becomes increasingly dependent on digital interactions. Our system can workwith any number of communications mediums simultaneously (e.g. text messages, phone calls, video calls, WhatsApp,and Facebook Messenger), it just requires that the records include the time, type, and pair of people involved for eachcommunication event.To train a model and evaluate its performance on converting input data into tie strength values, ground truth data onsocial ties is needed for some subset of those for whom we also have communication records. To meet this need, oursystem just requires a small number of top k lists of social ties. These lists can be procured at any time over the courseof the dataset, as long as there is concurrent communication data related to the person who’s top social ties make up thelist. Any representation of tie strength ground truth in practice would likely be from survey responses. This being thecase, a ranking-based ground truth would be more robust when compared to more common exact social tie values orbinary relationship labels. Using this ordering beneﬁts from never having to ask for explicit social tie values or cutoffs,which is important as these concepts would be highly subjective among survey respondees. Furthermore, we show that2 PREPRINT - J

ANUARY

26, 2021even asking for an explicit ranking in the survey is not required to avoid response bias. Instead we derive our top k rankings using survey questions based on Granovetter’s and Krackhart’s work.We found pairwise comparison based machine learning models to be excellent predictors in comparison to the baselines.These models are able to take advantage of using top k orderings and pairwise comparisons to provide many trainingexamples for their underlying models using a realistic amount of survey based ground truth data. This is importantas machine learning performance tends to rise with an increase in quality training data. This pairwise comparisonframework has the additional beneﬁt of producing an interpretable social tie value.We train these machine learning models using the top k lists, and show that once trained, we can use these models tocontinuously, and accurately, predict the evolving tie strength of one person towards another using just communicationdata. In the following sections we discuss our system in greater detail, evaluating its efﬁcacy at accurately producingevolving tie strength values. We then show that analyzing these dynamic values for all participants reveals interestingobservations related to communication evolution, relationship stability, and triadic dynamics. Data

To develop and evaluate our dynamic tie strength model, we needed data on a social network, and data on relationshipattributes that could be used to predict tie strengths within that social network. Optimally, both would extend over along enough period of time to capture changing relationship dynamics. The NetSense [14] study, which consist ofdata voluntarily collected from randomly selected students entering the the University of Notre Dame, ﬁts this need,providing linked ego network surveys and digital communications records. Data for this study was collected from Fall2011 and Spring 2013. Student phone records, including text messages and phone calls, were provided, along withcorresponding ego network surveys that were ﬁled out each semester by the participants. In terms of scope, NetSensefollowed 196 students at its peak, yielding extensive communication records that we could use to capture relationshipchanges over time.

Communication Records

NetSense’s communication record conforms to the standard Call Detail Record (CDR) format, listing a timestamp,sender, receiver, message type, and message length. The NetSense study contained , , events generated by theparticipants of the study. Text messages make up about of these events, with the remainder being calls. Despitethis imbalance in volume, phone calls remain an important medium for communication and carry an emotional weight,especially among the younger population captured in the study [15]. Thus, we choose to include both calls and textmessages. In fact, we ﬁnd that considering calls and text separately also improve the machine learning models weimplement. Ego Network Surveys

As mentioned earlier, ego network surveys were collected once a semester to complement the communication record.These surveys were prefaced with a question asking the survey-taker (the ego) to list individuals (the alters) withwhom they spend a signiﬁcant amount of time communicating or interacting. This list could include up to 20 alters,and could include people that were not involved with the study. This allowed for these lists to contain a variety ofrelationship types, including fellow students, roommates, parents, siblings, coworkers, and romantic partners. Theego was subsequently asked to specify their relationships with these individuals. This classiﬁcation was provided tothe ego as a closed list. In general, options available ranged in familiarity from “signiﬁcant other” and “parent” to“acquaintance”. Other related information on these alters was also collected through additional follow-up questionsthat included asking about the history of contact, shared interests and activities, and the frequency of communication.Importantly, the surveys also asked the ego to subjectively rate similarity and closeness with the alters.Despite the thoroughness of this study, the time between survey postings is signiﬁcant: the data has four ego networksurveys over the four semesters, and the study participants listed on average . people per survey. Regardless of thesparsity of these ego network surveys, our results demonstrate that they still provide sufﬁcient ground truth support forour models. Deﬁnition of Tie Strength

Using Granovetter’s and Krackhart’s tie strength deﬁnitions, we can outline a template for evaluating the connectionbetween an ego and their alters. In early works on this subject, tie strength was represented discretely as labels (suchas “close” or “not close”). More recently, tie strength has been encoded in the form of a numerical range, which3

PREPRINT - J

ANUARY

26, 2021conforms to Granovetter’s belief that tie strength is continuous, not discrete. These representations were varied, andwere often based on a combination of some large set of qualitative and quantitative predictors. However, these methodsfor interpreting tie strength are also usually reliant on platform-speciﬁc attributes [9, 10].To avoid these limitations, we take a different approach to encoding tie strength by simply ordering the list of alters fromour ego network surveys. For any given ego network survey, we produce a ranked list of the individuals listed in thesurvey, where the order is determined by how strong the survey-taker’s social tie is with each individual. Subsequently,each survey-taker produces a set of top k social tie rankings, timestamped by their respective ego network surveys(greater details of how this is done are presented in the methods section). We train and evaluate our social tie predictionmodels through how well their predicted tie strengths conform with these rankings at their corresponding times.Speciﬁcally, we introduce a suite of models that interpret our communication data streams as dyadic tie strengths. Thesetie strengths are represented by a signal value, which is used to establish a predicted ranking by ordering said signals bymagnitude. We compare this ordering with the corresponding survey’s ground truth top k social tie ranking.Given that these rankings are determined by how close an alter is to their ego, a model that is properly trained to producesignals that accurately reconstruct the appropriate ranking of each alter for any associated ego ultimately means that themodel is capable of generating continuous signals that are a representation of evolving tie strength between an ego andthose they’ve communicated with. In other words, a signal’s magnitude that indicates the level of afﬁnity an individualhas for another in the context of a ranked list ﬁts within the deﬁnition of evolving tie strength, which in the past has beendeﬁned loosely. And since the model is designed to generate signals over long periods of time (as tuned by the multiplesocial tie rankings over time in the training data), any new target individual with simple communication data should beable to have the model produce tie strengths that (when ordered by magnitude) are be able to effectively identify theircloser social ties and subsequently capture the evolution of their connections with the new target individual across thecourse of their communication data. Models

Our suite of models for this survey reconstruction process can be divided into two classes: a baseline class and amachine learning class. For the baseline class, we implement single-attribute models that use speciﬁc attributes ofcommunication behavior that are often cited as aspects of tie strength or directly used as a proxy for it [4, 10–13]. Thesecond class, the focus of this paper, uses machine learning methods on time series or a collection of single-attributemodel values to predict all out tie strengths for a target person whose communications record are given. These modelsdo this by making pairwise comparisons between everyone the target person had communicated with.

Baseline Models

It has long been postulated “the more frequently persons interact with one another, the stronger their sentiments offriendship for one another are apt to be” [16]. Previous research has often used the frequency of communication topredict tie strength or emotional closeness [10–12]. Following this established methodology, we created a frequencymodel that calculates frequency by dividing the number of communication events between two individuals by theelapsed time since they ﬁrst communicated at the timestep for which it is being evaluated. In addition, we assesseda recency model that uses the elapsed time since the last communication between two people as an inversely relatedestimate of frequency of contact, as is done in [11]. This measure of time since last communication was found to be themost predictive single feature in [10].Stronger ties, by deﬁnition, tend to involve longer time commitments [4]. Following this logic, we also created aduration model that uses the time since the ﬁrst communication record as a proxy for length of friendship or other socialbond. This was found to be the second most predictive feature in [10]. As another proxy for a pair’s time commitmentto communicating, we add the volume model, which counts the total number of calls and text messages between twoindividuals.Recently, there has been work showing that tie strength can also be predicted using the overlap of friend groups betweentwo speciﬁc individuals [17]. In one implementation of this concept [13], a metric called “weighted overlap” forsocial bow tie structures is used as a feature to help machine learning algorithms predict the tie strength between twoindividuals in a speciﬁc time frame. Given that this particular feature contributes heavily to the predictive performancein a couple of the tested cases in this work, we implement this metric here as well as a model to explore the predictivecapabilities of evolving friend group overlap. The speciﬁc implementation is shown in detail in the methods section.Given that this is the baseline class, we also ensure to set the lowest bar for survey reconstruction with a simple randombaseline. This randomly sorts the individuals with whom the target participant had communicated previously into anarbitrary ranking. 4

PREPRINT - J

ANUARY

26, 2021

Machine Learning Models

At their core, our machine learning models compare a selected individual’s (person A) communication history with oneindividual (person B) against A’s communication history with another (person C). Provided these two histories, themachine learning models then will predict, between B and C, which of the two will have a greater tie strength withA. When these comparisons are made for all pairs of people in the selected individual’s records, we can generate thepredicted ranked list for that person (see Methods for more details), and subsequently produce meaningful tie strengthvalues for all of this person’s relationships.These tie strengths for the machine learning models are expressed as winning percentages . For an individual beingevaluated, winning percentage is the fraction of pairwise comparisons with all other people the target had communicatedwith were the model predicts the evaluated individual has a stronger social tie. This score has the range [0 , wherehigher the score means the more likely the scored individual is closer to the selected individual. This tie strength valuecan also be generated at any point in time provided the models are trained and there is communication history availablefor those being considered in the pairwise comparisons.Pairwise comparison-based ranking models can also take advantage of a ground truth in ranked form. Speciﬁcally,selecting permutations from this ordering allows for the generation of many training examples from relatively littlesurveying. This is important as the quality of machine learning models is tied to the quantity and quality of trainingsamples.The ﬁrst machine learning model uses an ensemble method that utilizes features of duration, recency, frequency, andvolume in the communication data to inform a random forest classiﬁer [18]. The random forest classiﬁer predicts whichof the two compared connections with a selected individual is indicative of a greater tie strength, and thus should behigher ranked. The second machine learning model uses communications time series and recurrent neural networks,speciﬁcally a two-channel Long Short Term Memory (LSTM) networks [19]. The LSTM is used to make pairwisecomparisons, which are in turn used to produce a signal and ranking, all of which is done in the same manner as in theEnsemble model. As mentioned earlier, we ﬁnd that performance improves for both machine learning models whentexts and calls are treated as separate data streams. Results

Ranking Metrics

To determine survey reconstruction error and evaluate the models’ capabilities, we compare a model’s predicted rankedindividuals against the ground truth using the rank-biased overlap (RBO) [20]. RBO is an indeﬁnite rank similaritymeasure used to evaluate the similarity of two ranked incomplete lists, making it better suited for this task than, forexample, Jaccard. RBO has several other desirable attributes for this kind of comparison; it handles items being presentonly in one ranking, weights higher ranking items more, works with any given ranking length, and requires little to noassumptions about the data.

Ranking Performance

The evaluation process we chose for comparing our suite of models against the NetSense ground truth was the standard3-fold cross validation. Given the total list of NetSense egos, the validation process shufﬂed and equally split this listof participants into three mutually exclusive groups. For each fold, a test group was selected, with the remaining twogroups used as training data. Within a fold, we separate the training and testing data into four subsets, split equally bysemesterly survey time (the time at which the surveys were ﬁlled out by the egos). At each survey time, the trainingand testing subset only includes the surveys and communication data from before that time (therefore the later subsetscontain the earlier subsets). Then for each subset we trained and tested the machine learning models with all trainingdata available. We then subsequently tested these trained models on the available testing data in the subset, using themodels to predict the current testing ego’s surveys. This process was repeated for each subset, allowing more trainingand testing data to be released with each proceeding survey time. We do not, however, allow the models to train onthe preceding ground truth of the test data after it’s been predicted. Instead, we ensured the models have predicted thesurveys for the test egos for each survey time ﬁrst within the current fold before releasing ground truth for comparison.This setup prevents any model from using future communication history in any of the folds during evaluation. Once thissetup was completed for each model within the current fold, the fold score was determined by ﬁnding the weightedaverage survey reconstruction accuracy (RBO) across all available surveys for each individual test participant. Theweight of each predicted survey is the size of that survey’s ground truth, accentuating the prediction of larger surveys.5

PREPRINT - J

ANUARY

26, 2021Table 1: Results of the NetSense survey reconstruction models, with variance in parentheses

Model Class Model RBO

Baseline Random 0.037 (0.003)Overlap 0.064 (0.008)Duration 0.234 (0.033)Recency 0.307 (0.025)Frequency 0.320 (0.032)Volume 0.363 (0.032)Machine Learning Ensemble 0.450 (0.029)LSTM 0.481 (0.029)Given a score for every participant in this fold, this score was then averaged over all test participants. The ﬁnal scorewas computed as the average fold score over all folds, which are shown for all models in Table 1.Of the baseline class models, overall volume of calls and texts since the start of college was the most predictive. Thiswas followed by frequency and recency of communication. Duration of communication was a relatively distant fourth,but this may be tied to the time frame of data to which we had access to. Speciﬁcally, those with whom a studyparticipant had been friends with long before college could only have an estimated duration of friendship spanning backto the start of college. This start of college period was also a time when participants were making many new contacts.Some of these contacts would go on to become friendships, but many were just freshmen meeting new people whowould not be signiﬁcant as their time in college went on. For this reason, this may have been a poor estimate of thelength of friendship and therefore of the social bond.The limited scope of the data negatively affected the overlap model as well, which performed even worse than theduration model. The limit in accuracy here is most likely from the fact that the communication data only provides thecomprehensive communities of neighboring friends for study participants. Non-participants do not have their out-goingmessages recorded, so their only neighbors will always be strictly participants. The overlap model requires a sizablesample of the overlapping and non-overlapping neighbors of both individuals being evaluated for tie strength prediction.As one of these two individuals might be a non-participant, their neighborhood will be incomplete which skews theoverlap value. These issues demonstrate some of the difﬁculty of inferring social ties given just a single target person’srecords.But despite these limitations, the Ensemble and LSTM models produced RBO scores of . and . respectivelyfor the NetSense study. While frequency, recency, duration, and volume of communication have merit for approximatingrelationship strength on their own [10, 11, 16], they are unable to capture a greater whole of the latent mechanics ofsocial dynamics. But when all used in conjunction, this feature space allows for even a simple random forest classiﬁerto perform well. However, the relative performance of the Ensemble model hints at the weaknesses of such simplemodels for inferring complex social dynamics, and that even more improvements can be made. This is where the ﬁnalmodel, the LSTM comes in.Recurrent neural networks can accept, as input, temporal sequences of an arbitrary length. This allows them to useas features the whole communications histories from the start of the dataset to any time at which social ties are beingevaluated. This ability to analyze histories of interactions in a temporally aware way is likely the key to achieving thebest performance. Instead of using heuristic features calculated at a speciﬁc time, the LSTM is able to internally learnlatent features from patterns of communication over time that are most meaningful for evaluating the strength of socialties. Evolving Tie Strength Analysis

Evaluating Continuous Signals

As mentioned in the Data section, the signals that can be generated by the trained models are used to represent evolvingtie strengths due to the fact that the ground truth rankings that the models are ﬁt to are ordered by closeness. Hence, thesignal magnitude between two individuals is also the magnitude of tie strength between them. Now, to illustrate thesignal generation dynamics of our top performing models, we present the evolution of tie strengths between a NetSensestudy participant and a sample of their listed alters as encoded by our Volume model, the Ensemble model, and theLSTM. For consistency, we normalize all values. In this particular analysis we train the machine learning models usingall the NetSense data, excluding all data relating to the selected participant. We sequentially sampled the resultant signalvalues of our two trained machine learning models and the Volume model over the entire duration of the NetSense study,6

PREPRINT - J

ANUARY

26, 2021Figure 1: Generated signals for a randomly selected NetSense participant using the top three survey reconstructionmodels (Volume, Ensemble, and LSTM). The x-axis marks the timestamp, and the y-axis marks the signal magnitude.The colored vertical bars indicate the occurrence of a survey, denoting if the models correctly (or incorrectly) classiﬁedthe ranking of the considered individual.which includes the time of each survey where the ground truth values are known. We plot these sampled values againsttime, marking the times of the surveys and denoting if the signals, when ordered with all other alters in ascending orderby value, place each alter at the right survey rank when compared against ground truth. These signals, as generatedby the top four models for a selected participant, are shown in Figure 1. We speciﬁcally selected a subset of listedindividuals that had particular relationships with the target individual (e.g. parents, siblings, signiﬁcant others, andclose friends).The ﬁrst observation that we must be made in Figure 1 is the differences in signal shapes between model types. Naturallythis is due to the differing signal generation methods of each model. Speciﬁcally, the Volume model captures thecontinuously growing intensity of communication while the machine learning models convert their pairwise predictionsinto signals using comparative probability (i.e. a strong signal for an individual means they have a higher probability ofbeing closer to the selected participant than other individuals).But despite the differences in tie strength interpretations between models, there are still clear trends that are reﬂectedacross models. For example, in Figure 1, “Romantic Partner 1” becomes socially involved with our selected participant7

PREPRINT - J

ANUARY

26, 2021right around the time of the last survey. This is universally reﬂected by a massive spike in predicted tie strength, whichis maintained for the remainder of the study. The consistent communication between the selected participant and theirfamily (the sibling and parent) is also easily shown across the models, with accompanying high tie strength values.The universal visibility of these relationship transitions can be attributed to an accompanying spike in communicationactivity. Since the Volume model operates directly using total event occurrences, obvious shifts in communicationbehavior are easily captured in the outputted signals. However, only the machine learning models are able to capturemore nuanced trends visibly.One example can be found in the listed individual “Friend 4”. Around the 3rd semester, this friend and participant becamestrong friends, enough to warrant the participant placing the friend on their ego network survey. But communication issparse overall as shown by the Volume model, meaning a change in calls and text volume was not the biggest changehere. An underlying shift in the pattern of communication occurred in a way that only the machine learning models wereable to detect it, and subsequently boost the tie strength value between the two. We can use “Acquaintance 1” as anotherexample. This listed individual initially meets the participant around the ﬁrst semester, most likely through a studygroup. Beyond the ﬁrst semester the participant forms other more concrete and signiﬁcant social circles, relegatingthe acquaintance to a strictly academic role (which is why communication with them persists past the ﬁrst semesterthough they are no longer included within the surveys). This social tie weakens as class overlap inevitably diverges, andthe two eventually move on with their lives. While these initial anecdotes are interesting, analyzing the tie strengthdynamics of larger groups could allow us to draw stronger general conclusions.For the remainder of our analysis, we will be using the LSTM model with its winning percentage representation of tiestrength. In addition to being the best performing models in our suite, its comparative winning percentage representationof dynamic tie strength is meaningful and easy to understand. For this analysis, we sampled predicted tie strengths fromthe LSTM model across the duration of the NetSense study. We can represent the inferred strength of a social tie fromone person (who’s communications records are being used for the inference) to another as a directed edge betweennodes. The weight for each edge is the tie strength value itself, which varies as a function of time. With the resultingdynamic social network, we can analyze general tie strength trends and compare them with previous work to verifythe efﬁcacy of the model in capturing important relationships trends using our generalizable methodology of surveyreconstruction.To establish the groundwork for future research, we deliberately chose two broad subjects for initial analysis: dynamicsof relationships and dynamics of triadic groups. The analysis of tie strength evolution as it relates to different relationshiptypes is important, as the different kinds of classiﬁcations (i.e. friend, sibling, parent, etc) can often help determinethe trends of closeness between individuals in the past and future. We also choose to analyze triadic motifs within ourevolving network due to the importance of triads in distinguishing communities [21] and network structure as a whole.

Relationship Dynamics

As stated in the Data section, participants were asked to classify their relationships with those they listed in their egonetwork surveys. Given these classiﬁcations, we can analyze how signals on average change with time for differentnodes depending on their relationship type. In particular, we analyze the average edge weight over time for friends, kin,and signiﬁcant others (as identiﬁed by survey-takers), to evaluate relationship stability over time.In Figure 2a we show the average edge weight over time for our selected relationship types. We found that while friendstend to have a more volatile edge weight (due to the fact that there are so many different kinds of friends), parents(and similar kin) tend to stay fairly consistent with high tie strength. This agrees with previous research [22], whichshows that college students very often maintain consistent contact with their families. For those that have frequentcommunication with their parents and siblings, this behavior is correlated with their closeness to them, as reﬂected insurvey rankings.The average tie strength value is also very stable towards parents across time, with little variation (which can be even beseen in Figure 1), indicating that the rank held by parents rarely wavers between participants, conﬁrming the generalstrength of connection. This is likely due to the fact that if parents are even going to make it on to a ranked list ofindividuals with whom the participant has communicated with the most, these relationships are going to inherentlystable. If they were not, students would not invest time communicating with them enough to warrant listing them.Surprisingly, signiﬁcant others are often times ranked below parents. This difference in ranking is due to the fact thatthe label of romantic partner is not static, like the label of “parent.” Signiﬁcant others can be introduced prior to thestudy or anywhere during the study, and that label can be removed or changed at any time as well. This effect is seen toa greater degree with the term friend. In addition to the conditional assignments of the term, there are many differentstages of rapport that any friend could be while being included in a participant’s survey listing. It is also important to8

PREPRINT - J

ANUARY

26, 2021Figure 2: A sequence of illustrations for the analysis of the evolving tie values in NetSense as generated by one of ourbest survey reconstruction models. ( a ) shows the growth of the average edge weight of different relationships overtime. ( b ) shows the Kernel Density Estimation of transition differences between relationships. ( c ) shows the absolutedifference in edge weight between gendered majorities and minorities in detected triads in our social networks. ( d )shows the number of detected triads that ﬁt into one of the three standard triadic motifs.note that despite the volatility (or lack thereof) of these classiﬁcations, the growth of tie strengths inevitably slows astime goes on. This is reﬂected in Figure 1, as most signals settle after either a transition point or a period of growth/loss.This indicates in the absence of a perturbing force (e.g. getting to know a person, experiencing a breakup), relationshipstend to cement themselves to some standard level closeness, resulting in some consistent pattern of communication. Asimilar observation was made in Saramaki’s work [23]. In this paper, social signatures between individuals in a socialnetwork were derived directly from communication data. These social signatures were found to be generally stable andconsistent in shape, corroborating the trend of stability seen in tie strength signals here.We can further characterize our analysis of relationship interactions by analyzing the transition difference in signalweight over time. We do this by ﬁnding the greatest change in signal value (the “transition point”), then taking thedifference in the average signal before and after this transition point. We ﬁnd the transition difference for everylisted individual of the study participants, with the resultant values binned by relationship label. The Kernel DensityEstimation (Gaussian kernel) of the binned transition differences for Signiﬁcant Others, Parents, and Friends is seen inFigure 2b.We ﬁnd that for our data, most family-related relationships remain stable with a primary mode centered about , and avery slight mode about . , indicating there is usually either little change or deviation from whatever the initial signalwas, or if there was change, it was positive. Signiﬁcant others in NetSense had a more noticeable positive trend, with aprimary mode at . and a secondary mode at , with a tail that trails off into the negative region. This shape marks thedynamic of close (but non-familial) relationships, which undergo positive changes when the relationship forms, andnegative changes when a break-up occurs. And since this kind of relationship is a lot more volatile than a family-relatedrelationship, this type of dynamic occurs more often, and is therefore reﬂected more by our models. The distributions oftransition differences for friendships, like signiﬁcant others, has a wider mode at , with a smaller mode at . and a tailinto the negative domain. This shape can be attributed to the varying types of friendships that can be classiﬁed under the“friends” relationship class in the ego network surveys. An example the variety of friends and their associated signalsfor this can easily be seen in Figure 1 as well, where the signal shapes for the friend classiﬁcations differ visibly, asopposed to the signals of the kin classiﬁcations. Importantly, these signiﬁcant stability differences between friendshipsand kin relationships have been observed before [24]. In this work, kin relations were found to be more stable andmaintain a higher level of emotional closeness with little maintenance, compared to friendships which were less stableand required active maintenance to prevent decay. Our analysis further conﬁrms these observations with the clearquantiﬁcation of the difference in tie strength stability in friends versus family. Triadic Dynamics

To analyze the dynamics of triadic motifs within our network, we ﬁrst extract a set of triads that consistently occur inthe communication data across each semester. With these mappings we then analyzed the tie strengths for the in-degreeand out-degree edges between each of the nodes involved. With the mappings and associated the tie values obtained, wechoose to ﬁrst focus on the evolution differences between genders within triadic groups. In particular, we looked intomixed gender triads, analyzing groups where there were two males and one female, or two females and one male. Wetook the absolute difference between the average degree value of the majority gender and the average degree value ofthe edges from the majority gender to the minority gender. This difference characterizes how the majority treats the9

PREPRINT - J

ANUARY

26, 2021minority and evaluates the interconnected relationships. The differences across times for both triad types is presentedin Figure 2c. As seen in the plot, majorities do often interact with minorities differently, though male majorities doso to a greater degree than female majorities, though not by a great amount. The interactivity difference betweena triadic majority and minority can be viewed as two nodes uniting against one (either directly or indirectly). Thistwo-against-one behavior is fairly common in sociology [25], and happens to varying degrees. But how prevalent is thismotif?To answer this question, we can use our evolving tie strength values to observe the growth of this motif, and compare itto its counterparts: the weak link triad and the equalist triad. In the weak link scenario, two of the three nodes are highlyconnected; however, there is one link (in both directions) in the triad that is weak compared to the others. In terms ofsocial group formation, this would mean that while two members are good friends with the third member (and viceversa), the two members themselves are not as strong friends with each other. Alternatively, in the equalist scenario, alllinks in the triad are fairly equal in value. At every timestep in our dynamic network, we tally the number of triads thatmeet one of the three criteria to track the trends of the triadic dynamics. These trends are shown in Figure 2d. We ﬁndthat while often times equalist triads is the most prevalent, there is a consistently growing trend of two-against-one triadsthat become more established as time evolves. This is reﬂected in Figure 2c, as the absolute difference in edge weightbetween majority and minority grows. Overall, this indicates that the two-against-one motif is a prevalent dynamic intriads as tie strengths settle. Essentially, in a triadic dynamic, after friendships begin to cement, there will often be a“third wheel”.

Discussion

Tie strengths play an important role in the analysis of social networks, characterizing the relationship between individualsand providing insight into how those involved will interact with each other [4–6]. While past works have delvedinto predicting tie strength [9, 10], there has been limited research into the forecasting and subsequent analysis of tiestrength that evolve over long periods of time. The paucity of this kind of research is in part due to the difﬁculty incollecting data social ties as they change, which is typically done through surveys. Additionally, many past works tendto implement tie strength measures that depend on many platform-speciﬁc attributes. In this paper we address bothproblems by introducing a system that converts easily collected communication data into continuous tie strength valueswith machine learning. We design this system to be generalizable, depending only on the communication data and asparse number of ego network surveys. Using a small set of modular questions, we extract social tie rankings fromthe surveys that we use to train our predictive models and predict the social tie rankings. The trained models can alsoconvert communication data to continuous signals over time. Given the nature of these signals that are generated by ourmodels to predict survey rankings, we can interpret these values as continually evolving tie strengths. And with thesevalues, we can analyze the relationship dynamics of social networks.The NetSense study provided long term real-world communication data and surveys that provided ground truth valuesat points in time. Using machine learning, we are able to reconstruct ranked versions of these surveys with a relativelyhigh average RBO. Provided the resultant continuous tie strength values from the best-performing models, we areable to effectively track the evolution of relationships (like identifying the time at which a signiﬁcant other enters aparticipant’s social circle). Furthermore, we show that relationships with parents (and other close family, like siblings)remain fairly consistent over time, with signiﬁcant others coming in second in terms of tie strength stability. By furtheranalyzing tie strengths about signal transition points we show that while parent tie strengths tend to be very strong andstable, often going unchanged over time, while signiﬁcant others are more likely to experience signiﬁcant transitions(driven by the initial formation of the relationship, or a subsequent dissolution). We also establish the paradigm thatwithout a perturbing force, most relationships reach some form of resting-state as tie strengths settle.We further our analysis by looking into triadic dynamics of participants. We ﬁnd that in mixed triads, male majoritiestended to treat female minorities differently, while female majorities did the same to male minorities to a slightly lesserdegree. This behavior reﬂects the two-against-one triad motif. We delve into this observation further by comparingthis behavior against two other triad motifs (weak link and equality), and observe the growth of the three in our socialnetworks over time, discovering that the two-against-one dynamic increases as relationships cement themselves. Insummary, our novel system for predicting continuous tie strength values using general, platform-agnostic communicationdata establishes an innovative paradigm for studying the transitions and trends of interpersonal connections as theyevolve in dynamic social networks. Moving forward, this paper will act as a foundation for our continued analysis intothe evolution of relationships, and how they characterize past, present, and future interactions within social networks.10

PREPRINT - J

ANUARY

26, 2021

Methods

Establishing Social Tie Rankings from Ego Network Surveys

When implementing a tie strength measure, our foremost interest is choosing a system that allows for generalizabilityand customization, yet also produces a measure that is capable representing a nuanced spectrum of relationshipstrength. We additionally want to avoid a reliance on platform-speciﬁc attributes. While Marsden’s work indicatesthat closeness is the best predictor of tie strength over other factors (speciﬁcally frequency of communication andduration of relationship) [6], we choose to avoid making any particular assumptions about predictor importance as well.Therefore, we introduce tie strength as an ranked list of individuals, where the ordering determines the depth of therelationship between a listed individual and the the participant that took the ego network survey.In the ego network surveys the wording of the starting question that prompts a survey-taker to list individuals withwhom they’ve communicated is void of any instructions on the ordering of said list. Thus, we cannot rely on the orderof the raw list to be consistently indicative of a survey-taker’s preference on any of the listed people. Given this, weassume there is none initially and instead craft our own using the follow-up survey questions and the staples of tiestrength characterization as a guide. The answers to the follow-up questions are mostly selected from a set of answersthat indicate a range of magnitude. For example, when asking after a survey-takers’s perceived closeness with a listedindividual they can choose "Especially close", "Close", "Less than close", or "Distant". Most questions follow this form,though there are a few questions like "How long in years have you known this person?" that have open inputs that cantake any rational number. We utilized four inputs from our data as guided by the previously established deﬁnitions fortie strength: Closeness (how close the survey-taker is to one of the listed people), duration (how long the survey-takerhas known the person), frequency (how often does the survey-taker communicate with the person), and similarity(subjectively, how similar does the survey-taker think themselves to be to the person).To determine an ordering from these mixed inputs without assuming the importance of any one input over another, weuse a pairwise tournament selection process. Consider a ego network survey taken by one participant. Every individuallisted by the participant is compared against all of the other listed individual on a question. An individual that has agreater value in the question than a counterpart is awarded a point. If two listed individuals have the same value, bothare awarded a point. These points are aggregated across all the questions and then a ranked list is created by orderingeveryone by their score in descending order. If there is a tie in aggregate score, the inputs with rational numbers areused to break the tie (e.g. for two tied individuals, the one who has been known for longer ultimately wins). After alltournaments are complete for that ego network survey, we are left with our top k social ties ranking for the survey-taker,where the orderings indicate the importance of the listed individuals, as determined by the questions in the ego networksurvey. We repeat this process for all participants for all surveys throughout the study’s timeline. Ultimately, thesimplicity of this system ensures there are no assumptions made about the weighting of the questions. Additionally, thissystem is not dependent on a static set of features, since questions can be removed, replaced, or added and this won’tchange the architecture of how tie strength is generated in the end. The Bow Tie Overlap Model

Since overlapping (and non-overlapping) friend groups have shown to be a powerful tool in understanding tie strengthsbetween people [13, 17], implementing some measure of this kind of overlap is important to test its applicability withinour datasets. Therefore, we introduce the weighted overlap metric from Mattie’s work in bow tie frameworks [13] giventhat (as mentioned earlier) this feature was highly informative for a couple of their tie strength prediction machinelearning models. Weighted overlap is deﬁned as below for two individuals i and j : (cid:101) o ij = (cid:80) k ∈ n ij ( w ik + w jk ) s i + s j − w ij (1)Where n ij is the shared friends between i and j . That is, the overlap in the K = 1 neighbors of i and j . We interpret theweights w ij here to be the total number of events between some i and j before the time of the survey being evaluatedfor reconstruction. And s i ( s j ) is the total number of events generated by i ( j ) before the time of the survey. Therefore,if all the individuals that have communicated with i have also communicated with j and vice versa, then (cid:101) o ij = 1 . Andso for some target individual i , we iteratively consider every communicated with individual as j and sample each (cid:101) o ij before the considered survey time. We then rank by value of (cid:101) o ij to predict the ground truth survey ordering.11 PREPRINT - J

ANUARY

26, 2021

Machine Learning Models

The primary models of this paper are our machine learning models. The machine learning models consider onetarget person at a time and make pairwise comparisons between the people with whom the target person has anycommunications history. Speciﬁcally, we consider the Ensemble model (which makes these pairwise comparisons witha random forest classiﬁer), and the LSTM model (a two-channel long short-term memory recurrent neural network).For both models we use a method of ranking called Borda count [26] for pairwise comparisons to generate the predictedranked lists that we compare against the ground truth. This method is commonly viewed as “an information-theoreticallyoptimal procedure” for recovering the top k ranked items based on noisy comparisons that emphasizes simplicity,optimality, and robustness with regards to the underlying pairwise-comparison probability generation.Given collection of n people whom the target person has interacted indexed by the set [ n ] ≡ { , ..., n } , we create amatrix M of dimensions n × n where M ij is the probability of i having a greater social tie with the target person than j as determined by the random forest or LSTM using i ’s and j ’s communication history with the target person up to thetime of consideration. The diagonal of M , where i = j , is set to a probability of . Now to ﬁnd the Borda score, wemust keep track of wins and losses in the pairwise tournament in M . To do this, we transform M into M (cid:48) using Eq. 2. M (cid:48) ij =  M ij > M ij = − M ij < (2)The Borda count itself for i ∈ [ n ] , which is used to form the actual ranking, is calculated using Eq. 3. B i = n (cid:88) j =1 M (cid:48) ij (3)We then ﬁnd B i for all i ∈ [ n ] and then order by magnitude. This becomes the current predicted ranking that wecompare against ground truth. Now, to generate the signals for Figure 1 and our network analysis we can convert thecount to the winning percentage with Eq. 4. In the equation, w i , l i , and t i are the number of head to head wins, losses,and ties for i . We generate the winning percentage incrementally across the entire NetSense study, and the resultanttime series is then used as the dynamic edge weights between an ego and those they’ve communicated with.WinningPercentage i = B i + ( n − n −

1) = w i + 0 . · t i w i + l i + t i (4) Ensemble Model

The Ensemble model uses a random forest classiﬁer to generate the pairwise comparison probabilities in M . Theclassiﬁers for the best performing Ensemble model used 100 weak classiﬁers. These classiﬁers are trained usinga speciﬁc feature vector that is used to predict which individual will the target person have a greater social tie to.Consider the features for i ∈ [ n ] (denoted as f i ) as the four baseline class features computed for just calls and justtexts. These features are frequency, recency, duration, and volume as described in Models section. Integrating theBow Tie Overlap attribute signiﬁcantly brings down overall performance, and so was excluded in the ﬁnal Ensemblemodel. We take the difference of these two feature vectors as the given feature vector for the classiﬁers, deﬁned asDifferenceFeatureVector ( x, y ) = f x − f y given x, y ∈ [ n ] . LSTM Model

For our LSTM models, f i is a two-channel time series. The two channels are the histories of calls and texts, both binnedinto 21 days intervals. The feature vector used by the LSTM is the time series for the two individuals being comparedstacked on each other, resulting in a four channel time series that spans through time at which the social tie is beingevaluated to the ﬁrst interaction between the target person and either person in the comparison. References [1] Stephen P Borgatti, Ajay Mehra, Daniel J Brass, and Giuseppe Labianca. Network analysis in the social sciences. science , 323(5916):892–895, 2009. 12

PREPRINT - J

ANUARY

26, 2021[2] James A Kitts, Eric Quintane, and ESMT Berlin. Rethinking social networks in the era of computational socialscience, 2020.[3] Mark T Rivera, Sara B Soderstrom, and Brian Uzzi. Dynamics of dyads in social networks: Assortative, relational,and proximity mechanisms. annual Review of Sociology , 36:91–115, 2010.[4] Mark S Granovetter. The strength of weak ties. In

Social networks , pages 347–367. Elsevier, 1977.[5] David Krackhardt, N Nohria, and B Eccles. The strength of strong ties.

Networks in the knowledge economy , 82,2003.[6] Peter V Marsden and Karen E Campbell. Measuring tie strength.

Social forces , 63(2):482–501, 1984.[7] Verónica Policarpo. What is a friend? an exploratory typology of the meanings of friendship.

Social Sciences ,4(1):171–191, 2015.[8] J-P Onnela, Jari Saramäki, Jorkki Hyvönen, György Szabó, David Lazer, Kimmo Kaski, János Kertész, and A-LBarabási. Structure and tie strengths in mobile communication networks.

Proceedings of the national academy ofsciences , 104(18):7332–7336, 2007.[9] Jason J Jones, Jaime E Settle, Robert M Bond, Christopher J Fariss, Cameron Marlow, and James H Fowler.Inferring tie strength from online directed behavior.

PloS one , 8(1), 2013.[10] Eric Gilbert and Karrie Karahalios. Predicting tie strength with social media. In

Proceedings of the SIGCHIconference on human factors in computing systems , pages 211–220, 2009.[11] M. Conti, A. Passarella, and F. Pezzoni. A model for the generation of social network graphs. In , pages 1–6, 2011.[12] Jason Wiese, Jun-Ki Min, Jason I Hong, and John Zimmerman. "you never call, you never write" call and smslogs do not always indicate tie strength. In

Proceedings of the 18th ACM conference on computer supportedcooperative work & social computing , pages 765–774, 2015.[13] Heather Mattie, Kenth Engø-Monsen, Rich Ling, and Jukka-Pekka Onnela. Understanding tie strength in socialnetworks using a local “bow tie” framework.

Scientiﬁc reports , 8(1):1–9, 2018.[14] Rachael Purta, Stephen Mattingly, Lixing Song, Omar Lizardo, David Hachen, Christian Poellabauer, and AaronStriegel. Experiences measuring sleep and physical activity patterns across a large college cohort with ﬁtbits. In

Proceedings of the 2016 ACM international symposium on wearable computers , pages 28–35, 2016.[15] Bethany L Blair, Anne C Fletcher, and Erin R Gaskin. Cell phone decision making: Adolescents’ perceptions ofhow and why they make the choice to text or call.

Youth & Society , 47(3):395–411, 2015.[16] George Homans.

The Human Group , page 133. Harcourt, Brace & World, 1950.[17] Elizabeth Bott and Elizabeth Bott Spillius.

Family and social network: Roles, norms and external relationships inordinary urban families . Routledge, 2014.[18] Leo Breiman. Random forests.

Machine learning , 45(1):5–32, 2001.[19] Sepp Hochreiter and Jürgen Schmidhuber. Long short-term memory.

Neural computation , 9(8):1735–1780, 1997.[20] William Webber, Alistair Moffat, and Justin Zobel. A similarity measure for indeﬁnite rankings.

ACM Transactionson Information Systems (TOIS) , 28(4):1–38, 2010.[21] Comandur Seshadhri, Tamara G Kolda, and Ali Pinar. Community structure and scale-free collections oferd˝os-rényi graphs.

Physical Review E , 85(5):056109, 2012.[22] Yi-Fan Chen and James E Katz. Extending family to school life: College students’ use of the mobile phone.

International Journal of Human-Computer Studies , 67(2):179–191, 2009.[23] Jari Saramäki, Elizabeth A Leicht, Eduardo López, Sam GB Roberts, Felix Reed-Tsochas, and Robin IM Dunbar.Persistence of social signatures in human communication.

Proceedings of the National Academy of Sciences ,111(3):942–947, 2014.[24] Sam GB Roberts and Robin IM Dunbar. The costs of family and friends: an 18-month longitudinal study ofrelationship maintenance and decay.

Evolution and Human Behavior , 32(3):186–197, 2011.[25] Theodore Caplow.

Two against one: Coalitions in triads.

Prentice-Hall, 1968.[26] Nihar B Shah and Martin J Wainwright. Simple, robust and optimal ranking from pairwise comparisons.

TheJournal of Machine Learning Research , 18(1):7246–7283, 2017.13

PREPRINT - J

ANUARY

26, 2021

Acknowledgements

This work was sponsored in part by DARPA under contract W911NF-17-C-0099, the Army Research Ofﬁce (ARO)under contract W911NF-17-C-0099, and the Ofﬁce of Naval Research (ONR) under grant N00014-15-1-2640. Theviews and conclusions contained in this document are those of the authors and should not be interpreted as representingthe ofﬁcial policies either expressed or implied of the U.S. Government.