[PDF] Characterization and comparison of large directed graphs through the spectra of the magnetic Laplacian

Abstract

In this paper we investigated the possibility to use the magnetic Laplacian to characterize directed graphs (a.k.a. networks). Many interesting results are obtained, including the finding that community structure is related to rotational symmetry in the spectral measurements for a type of stochastic block model. Due the hermiticity property of the magnetic Laplacian we show here how to scale our approach to larger networks containing hundreds of thousands of nodes using the Kernel Polynomial Method (KPM). We also propose to combine the KPM with the Wasserstein metric in order to measure distances between networks even when these networks are directed, large and have different sizes, a hard problem which cannot be tackled by previous methods presented in the literature. In addition, our python package is publicly available at \href{this https URL}{this http URL}. The codes can run in both CPU and GPU and can estimate the spectral density and related trace functions, such as entropy and Estrada index, even in directed or undirected networks with million of nodes.

Full PDF

CCharacterization and comparison of large directed graphs through the spectraof the magnetic Laplacian

Bruno Messias F. de Resende a) and Luciano da F. Costa Physics Institute of S˜ao Carlos, University of S˜ao Paulo, S˜ao Carlos, SP 13566-590,Brazil (Dated: 20 July 2020)

In this paper we investigated the possibility to use the magnetic Laplacian to characterize directed graphs(a.k.a. networks). Many interesting results are obtained, including the finding that community structure isrelated to rotational symmetry in the spectral measurements for a type of stochastic block model. Due thehermiticity property of the magnetic Laplacian we show here how to scale our approach to larger networkscontaining hundreds of thousands of nodes using the Kernel Polynomial Method (KPM). We also proposeto combine the KPM with the Wasserstein metric in order to measure distances between networks evenwhen these networks are directed, large and have different sizes, a hard problem which cannot be tackledby previous methods presented in the literature. In addition, our python package is publicly available atgithub.com/stdogpkg/emate. The codes can run in both CPU and GPU and can estimate the spectral densityand related trace functions, such as entropy and Estrada index, even in directed or undirected networks withmillion of nodes.

The Laplacian operator of a directed network isnot Hermitian. This property hampers the in-terpretation of the spectral measurements andrestricts the use of computational methods de-veloped in network science. In this work, wepropose a framework and novel measures basedon the spectrum of the magnetic Laplacian tostudy directed networks. By using the proper-ties of circulant matrices, we show analyticallythat novel measurements are able to grasp infor-mation about the structure of directed networks.It shows that the number of modular structuresin networks is related to the rotational symmetryof the spectrum, and therefore can contribute tocharacterize the parameters of the directed net-works. To infer the generative parameters of net-works, we propose the application of the Wasser-stein metric to measure the distance between thespectra of the magnetic Laplacian, allowing net-works to be compared. All the proposed methodsdepend on the diagonalization of the magneticLaplacian operator, which implies a high com-putational cost. Therefore, the calculations canbecome unfeasible. To overcome this limitation,we implemented the Kernel Polynomial Method(KPM) using TensorFlow package. This methodapproximates the spectrum density of Hermitianmatrices with a lower computational cost, allow-ing the spectral characterization of large directednetworks containing hundreds of thousands ofnodes. a) Electronic mail: [email protected]

I. INTRODUCTION

In the seminal work

Can one hear the shape of adrum? Mark Kack discusses the relationship betweena membrane and the set of eigenvalues (spectrum) of theLaplacian operator. However, this relationship was iden-tified to be not unique , in the sense that two distinctmembranes (non-isometric manifolds) can have the samespectrum. Nevertheless, despite such degeneracies, spec-tral information can provide valuable insights about thereal world. For instance, spectral geometry has been usedto study physical phenomena such as quantum gravity and provided the basis for developing algorithms in com-puter science .Although the analysis of continuous regions such asthose considered by Kack remains an interesting issue,several phenomena in nature and society need to be mod-eled in terms of discrete structures such as networks. Inthis case, we can adapt Kack’s question as Can one hearthe shape of a network?

The answer to this question isanalogous to what has been verified for the original ques-tion, i.e., two nonisomorphic networks can share the samespectrum . Despite such a limitation, the spectral ap-proach to discrete structures can still be useful in somepractical and theoretical problems . An example of aspectral approach that has been applied to characterizenetworks is the von-Neumann entropy .More recently, the concept of entropy of a graph hasbeen used to measure the similarity between two givennetworks . Examples of this approach include the en-tropic similarity applied to the inference of parametersof network models . However, this measure cannotbe immediately extended to directed networks and, ashas been shown in , the directed edges have substantialimplications in dynamics on graphs.In addition, the entropic similarity depends on theproduct of the matrices associated with the given net-works. This implies that this similarity measurement is a r X i v : . [ c s . S I] J u l not invariant with respect to permutations of the indicesassociated with the nodes. Given these dependencies,such measurements are not well defined when the nodescannot be associated with fixed indices.Given that directed networks can accurately modelseveral real-world problems, it is essential to develop newmethodologies capable of dealing with network direction-ality. An immediate difficulty implied by this scenario isthat the associated Laplacian operator will often havecomplex values, because the adjacency matrix associatedwith the directed networks is non-symmetric. A promis-ing approach to address this problem consists of studyingdirected complex networks while considering their mag-netic Laplacian operator . As an example, in theauthors showed that the results of community detectionalgorithms could be improved by considering the mag-netic Laplacian associated with the directed network.In this work, we show that the magnetic Laplacian ap-proach can be used to characterize complex networks,including those with hundreds of thousands of nodes. Bycharacterization, we mean that measurements taken fromthis operator contribute to identify the network model re-sponsible for generating a given network, as well as per-forming the inference of parameters responsible for gen-erating a given specific network configuration. Several re-sults were obtained. First, for simpler models (i.e., mod-ular regular networks), the number of modular structuresis related to the specific heat rotational symmetry. Sub-sequently, we showed that these spectral measurementscombined with the Wasserstein distance between spectraldensities , can provide valuable contributions to inferthe original parameters used for getting those networks,with relative errors smaller than 1%. II. METHODSA. Magnetic Laplacian

A directed network can be expressed by a tuple 𝐺 =( 𝑉, 𝐸, 𝑤 ), where 𝑉 is the set of vertices, and | 𝑉 | stands forthe number of the vertices; 𝐸 is the set of edges such thatfor each 𝑢, 𝑣 ∈ 𝑉 the ordered tuple 𝑒 = ( 𝑢, 𝑣 ) ∈ 𝐸 assignsa directed edge from vertex 𝑢 to 𝑣 and 𝑤 : 𝐸 → R . Adirected network can be associated with an undirectedcounterpart 𝐺 ( 𝑠 ) = ( 𝑉, 𝐸 ( 𝑠 ) , 𝑤 ( 𝑠 ) ), where 𝑤 ( 𝑠 ) ( 𝑢, 𝑣 ) = 𝑤 ( 𝑢,𝑣 )+ 𝑤 ( 𝑣,𝑢 )2 . However, the directionality of 𝐺 is lostin 𝐺 ( 𝑠 ) .In order to preserve the Hermiticity and the informa-tion about directionality , define 𝛾 , as 𝛾 : 𝐸 → 𝒢 , where 𝒢 is a group, such that 𝛾 ( 𝑢, 𝑣 ) − = 𝛾 ( 𝑣, 𝑢 ), choosing 𝒢 = 𝑈 (1) and expressing 𝛾 as 𝛾 𝑞 ( 𝑢, 𝑣 ) = exp(2 𝜋𝑖𝑞𝑓 ( 𝑢, 𝑣 )) , (1)where 𝑞 ∈ [0 ,

1] and 𝑓 ( 𝑢, 𝑣 ) = 𝑤 ( 𝑢, 𝑣 ) − 𝑤 ( 𝑣, 𝑢 ) representsthe flow in a given vertex 𝑢 due to another vertex 𝑣 . The symmetric network equipped with 𝛾 𝑞 has infor-mation about directed edges and, at the same time, theadjacency matrix is Hermitian.Now, we consider the following operator, associatedwith ( 𝐺 ( 𝑠 ) , 𝛾 𝑞 ) where ⊙ is the Hadamard product: L 𝑞 = D − Γ 𝑞 ⊙ W ( 𝑠 ) , (2)where D is the degre matrix which contains the node de-grees along its main diagonal; [ Γ 𝑞 ] 𝑢,𝑣 = [ Γ † 𝑞 ] 𝑣,𝑢 = 𝛾 𝑞 ( 𝑢, 𝑣 )and [ 𝑊 ( 𝑠 ) ] 𝑢,𝑣 = [ 𝑊 ( 𝑠 ) ] 𝑣,𝑢 = 𝑤 ( 𝑠 ) ( 𝑢, 𝑣 ) .It is interesting to observe that this operator corre-sponds to the magnetic Laplacian , 𝐿 𝑞 . The reasonfor the term magnetic is that the operator can be used todescribe the phenomenology of a quantum particle sub-ject to the action of a magnetic field . Due to this phys-ical context, the parameter 𝑞 is named charge.By construction, D and W ( 𝑠 ) are both symmetric and Γ 𝑞 is Hermitian. Consequently, L 𝑞 is Hermitian. In ad-dition, it is sometimes convenient to use a normalizedversion of L 𝑞 , which is given by H 𝑞 = √ D − L 𝑞 √ D − , (3)where the H 𝑞 is defined only if the network is at leastweakly connected.A given eigenvector of H 𝑞 , | 𝜓 𝑙,𝑞 ⟩ ∈ C | 𝑉 | , can be ob-tained as solution of H 𝑞 | 𝜓 𝑙,𝑞 ⟩ = 𝜆 𝑙,𝑞 | 𝜓 𝑙,𝑞 ⟩ (4)where 𝜆 𝑙,𝑞 ∈ R and 𝜆 ,𝑞 ≤ 𝜆 𝑙,𝑞 ≤ · · · ≤ 𝜆 | 𝑉 | ,𝑞 It is possible to enhance the analogy with physical sys-tems by including a temperature parameter 𝑇 ∈ R + . Byusing this parameter, the network properties can be stud-ied from the statistical mechanics viewpoint.Here, we adopted the Boltzmann-Gibbs statistical me-chanics formulation as a means to associate the partitionfunction 𝑍 ( 𝑇, 𝑞 ) = | 𝑉 | ∑︁ 𝑙 =1 𝑒 − 𝜆𝑙,𝑞𝑇 (5)with 𝐺 .By using Eq.(5), the expected value at temperature Tof a operator 𝑂 can be expressed in terms of its eigenval-ues { 𝑜 𝑙 } as ⟨ 𝑂 ⟩ = 1 𝑍 ( 𝑇, 𝑞 ) | 𝑉 | ∑︁ 𝑙 =1 𝑒 − 𝜆𝑙,𝑞𝑇 𝑜 𝑙 . (6)In this work, we use Eq.(6) to define the measure of spe-cific heat, 𝑐 𝜆 , associated with a network. This novel mea-surement is given by 𝑐 𝜆 ( 𝑞, 𝑇 ) = ⟨ 𝐻 𝑞 ⟩ − ⟨ 𝐻 𝑞 ⟩ 𝑇 . (7) FIG. 1. In (a), (b) and (c) we have a SF, ER and BA network. The color maps, 𝑘 𝑖𝑛 is the indegree of a given node. In (d),(e) and (f) it is shown the specific heat in terms of the charge 2 𝜋𝑞 (polar coordinates) and temperature (radial coordinate)for a Bollobas et al. scale-free network , ER , and BA network respectively. The parameters used to generate those networkswere | 𝑉 | = 1000; the edge probability for ER was 𝑝 = 0 . 𝑚 = 3.The temperature range and charge are uniformly sampled form interval [0 . , .

15] and [0 , /

2] with 30 points each. As canbe noted the 𝑐 𝜆 shows a specific pattern for each network. This fingerprint pattern for each network explains why the SOM(Self-Organization Map) was so successful in the task of organizing networks belonging to the same classes onto the samegroups using only the specific heat, without any knowledge about that classes. It follows from Eq.(1) that the eigenvalues, andtherefore 𝑐 𝜆 are symmetric with respect to the addition of integer values to the charge 𝛾 𝑞 = 𝛾 𝑞 + 𝑗 ∀ 𝑗 ∈ Z , reflecting in thebilateral symmetry with respect to the horizontal axis in (d), (e) and (f). The Eq.(7) has two free parameters, namely 𝑞 and 𝑇 .Because of this free choice of parameters and, owing tothe fact that we have a rotation associated with directededges ( 𝛾 𝑞 ), we plot 𝑐 𝜆 in two dimensions , setting 2 𝜋𝑞 asthe polar coordinate, and 𝑇 as the radial one. Regard-ing the interpretation and justification of physics-relatedquantities such as the specific heat it is directly relatedto the variance of the eigenvalue spectrum. As a conse-quence, that quantity provides a signature of the spec-trum properties, contributing to the characterization ofthe network structure. B. Directed modular networks

In this work, we resort to a type of directed stochas-tic block model in order to obtain a good control of thenetwork properties such as community size, and also be-cause of its potential for facilitating analytical studies.The adopted stochastic block model networks were ob-tained as follows1. Split the set 𝑉 onto 𝑁 𝑓 equal-size sets( 𝑓 , 𝑓 , . . . , 𝑓 𝑁 𝑓 ).2. For each 𝑢, 𝑣 ∈ 𝑓 𝑖 create a directed edge ( 𝑢, 𝑣 ) withprobability 𝑝 𝑐 .3. For each 𝑢 ∈ 𝑓 𝑖 and a 𝑣 ∈ 𝑓 𝑖 +1 (assuming 𝑓 𝑁 𝑓 +1 = 𝑓 ), create a directed edge ( 𝑢, 𝑣 ) with probability 𝑝 𝑑 . C. Spectral entropy of directed networks

Recent works reported how to use entropic measure-ments to quantify the similarity between two undirectednetworks . The entropy of a network is derived fromthe usual Laplacian spectrum (all eigenvalues are real).By contrast, these measurements cannot be used in thecase of directed networks because the adjacency ma-trix is not Hermitian. However, the magnetic Laplacianmethodology yields a Hermitian operator 𝐻 𝑞 , which ishere used to define an entropic measurement for directednetworks.Recall that a quantum system at finite temperature, 𝑇 ,is defined by its respective density matrix, 𝜌 ( 𝑇 ) . For anetwork 𝐺 and charge 𝑞 , this operator can be expressedin terms of the eigenvalues and eigenvectors associatedto 𝐻 𝑞 as 𝜌 𝑞 ( 𝑇 ) 1 𝑍 ( 𝑇, 𝑞 ) | 𝑉 | ∑︁ 𝑙 =1 𝑒 − 𝜆𝑞,𝑙𝑇 | 𝜓 𝑙,𝑞 ⟩⟨ 𝜓 𝑙,𝑞 | . (8)The previously defined density matrix can be used inorder to define measurements associated with a directed(or undirected) network. For instance, by using the pre-vious definition, the concepts of spectral entropy of anetwork can be extended for the directed case by usingthe following equation 𝑆 ( 𝐺, 𝑞, 𝑇 ) = Tr [ 𝜌 𝑞 ( 𝑇 )Log 𝜌 𝑞 ( 𝑇 )] , (9)where Log is the matrix logarithm and Tr corresponds tothe trace operation.Given the definition of spectral entropy, we can extendthe entropic dissimilarity between two directed networks,˜ 𝐺 and 𝐺 , as 𝑆 𝑑 ( ˜ 𝐺, 𝐺, 𝑞, 𝑇 ) = 𝑆 ( ˜ 𝐺, 𝑞, 𝑇 ) − Tr [ ˜ 𝜌 𝑞 ( 𝑇 )Log 𝜌 𝑞 ( 𝑇 )] . (10)

012 3( a ) 012 3( b ) FIG. 2. The networks in (a) and (b) are isomorphic in thesense that they can be mapped one into the other by changingthe indexes 0 → , → However, as can be noted the term within the tracedepends on the product of distinct matrices. Thus, evenif two networks presented in Fig.2 are isomorphic, themeasure of entropic dissimilarity is nonzero when it de-sirable be null. Another issue related with the entropicsimilarity approach is that measure cannot be used tocompare networks with different number of nodes. Thisis interesting also because the largest weakly connectedcomponent generated by a model does not necessarilyhave the same size as the overall number of nodes. Atthe same time, such measure has a high computationalcost.In this work, we suggest the application of the kernelpolynomial method jointly with the Wasserstein metricin order to quantify the dissimilarity between the directednetworks. It should be emphasized that this combinationof approaches is only possible given that the the magneticLaplacian is a Hermitian operator.

D. Comparasion of large directed networks: The KPMmethod and the Wasserstein Metric

In order to compute the spectral distance between twonetworks it is necessary to compute the spectral density, 𝜌 𝑞 ( 𝜆 ) = 1 | 𝑉 | | 𝑉 | ∑︁ 𝑙 =1 𝛿 ( 𝜆 − 𝜆 𝑙,𝑞 ) (11) which has complexity order 𝑂 ( | 𝑉 | ). As such, this ap-proach becomes unfeasible for larger networks ( | 𝑉 | > ). Fortunately, the magnetic Laplacian matrix is Her-mitian and often sparse, so that the method known askernel polynomial (KPM) can be considered for esti-mating the 𝜌 𝑞 .The KPM objective consists in calculating a simplex { p ∈ R 𝑛 + : 𝑛 ∑︀ 𝑖 =1 𝑝 𝑖 = 1 } which allows to define a discretemeasure 𝛼 𝑞 = 𝑛 ∑︁ 𝑖 =1 𝑝 𝑖,𝑞 𝛿 𝜆 𝑖,𝑞 (12)that approximates the Eq.(11) with enough accuracy.This method is based in two approximations. Thefirst is that any continuous real function in an interval[ − ,

1] can be expanded in terms of Chebyshev polyno-mials , allowing the spectral density to be approximatedby 𝑛 terms. The second approximation consists in eval-uating the traces associated with the terms of that ex-pansion using Hutchinson’s approach . In essence, thetrace of a sparse matrix function can be aproximatedby the product of this function by a set of random vec-tors. The oscilations induced by these approximationscan be smoothed by subsequently applying a known ker-nel, which in the present work corresponds to the Jack-son kernel . Therefore, the KPM allows the spectrum ofmagnetic Laplacian to be estimated by using algorithmswith near-linear computational cost. In this way, it be-comes possible to estimate the spectral density and mea-surements such as entropy and specific heat even in thecase of very large networks containing million of nodes.Given that it is possible to effectively estimate themagnetic Laplacian spectral density, we can employWasserstein metric in order to define distances betweendirected networks.For instance, let the set of admissible couplings of twoprobability distributions 𝛼 𝑞 and ˜ 𝛼 𝑞 , given as 𝑈 ( 𝛼, ˜ 𝛼 ) = { U ∈ R | 𝑉 |×| ˜ 𝑉 | + : U1 | ˜ 𝑉 | = p , U 𝑇 | 𝑉 | = ˜p } (13)For a 𝑑 ≥ 𝑊 𝑑 ( 𝑝, ˜ 𝑝, 𝑞 ) = ⎛⎝ min 𝑃 ∈ 𝑈 ( 𝛼, ˜ 𝛼 ) ⎡⎣∑︁ 𝑖,𝑗 | 𝜆 𝑖,𝑞 − ˜ 𝜆 𝑗,𝑞 | 𝑑 𝑈 𝑖,𝑗 ⎤⎦⎞⎠ /𝑑 . (14)This function has several desired characteristics, such as:it is a metric, it can be applied to networks with differentnumber of nodes, it has relaxed implementations thatallow the distance value to be obtained with a smallercomputational cost.The task of estimating the value of a parameter usedto generate a given network, such as the connecting prob-ability in the ER model, can be approached by seekingfor a minimum Wasserstein distance between the origi-nal network and a set of 𝑛 𝑒𝑥𝑝 networks synthesized byconsidering several parameters. In this work, we chose aset of 𝑛 𝑞 charges from which the magnetic Laplacians ofeach candidate network is obtained, then KPM is usedto obtain the respective spectra, and the minimal dis-tance between the latter and the original is determinedby using the Wasserstein distance ⟨ 𝑊 𝑑 ⟩ ( 𝑝 ) = 1 𝑛 𝑒𝑥𝑝 ∑︁ 𝑝 ∈ 𝑃 ⎛⎝ 𝑛 𝑞 ∑︁ 𝑞 ∈ 𝑄 𝑊 𝑑 ( 𝑝, ˜ 𝑝, 𝑞 ) ⎞⎠ . (15) III. RESULTSA. Community structures in network and spectralsymmetries

As a first step to address the problem of characterizingdirected complex networks by using the magnetic Lapla-cian formalism, we derive some analytic and numericalresults relating network structure and the spectrum ofthe magnetic Laplacian operator.First, we aim at studying the influence of communitystructure in directed networks on the magnetic Lapla-cian spectrum and, consequently, on the specific heat, 𝑐 𝜆 .We assume that the connections within the communities, W in , as well as between the communities, W out , are notdifferentiated between the structures. Under this hypoth-esis, the adjacency matrix can be organized as follows,assuming 𝑁 𝑓 communities (henceforth, we take 𝑁 𝑓 > W = ⎡⎢⎢⎢⎣ W in W out 𝑁 𝑐 . . . 𝑁 𝑐 𝑁 𝑐 W in W out . . . 𝑁 𝑐 ... ... ... . . . ... W out 𝑁 𝑐 𝑁 𝑐 . . . W in ⎤⎥⎥⎥⎦ , (16)where 𝑁 𝑐 is a null matrix 𝑁 𝑐 × 𝑁 𝑐 . For generality’s sake W in and W out can be constructed in arbitrary form.The magnetic Laplacian expressed as discussed abovehas the following organization: H 𝑞 = ⎡⎢⎢⎢⎣ H in H out 𝑁 𝑐 . . . H out † H out † H in H out . . . 𝑁 𝑐 ... ... ... . . . ... H out 𝑁 𝑐 𝑁 𝑐 . . . H in ⎤⎥⎥⎥⎦ , (17)note that this matrix is circulant, i.e. H 𝑞 = ⎡⎢⎢⎢⎣ h h . . . h 𝑁 𝑓 − h 𝑁 𝑓 − h . . . h 𝑁 𝑓 − ... ... . . . ... h h . . . h ⎤⎥⎥⎥⎦ . (18) Observe that H 𝑞 is a specific case of a Toepltiz matrix ,so that the eigenvalues can be obtained considering theproperty that all the columns in the original matrix canbe expressed as cyclic permutations of the first column.Our objective now is to find the set { 𝜆 𝑢 } such that H 𝑞 | 𝜓 𝑢 ⟩ = 𝜆 𝑢 | 𝜓 𝑢 ⟩ . (19)As known from literature , the eigenvectors of a cyclicmatrix can be obtained as | 𝜓 𝑢 ⟩ = ⎡⎢⎢⎢⎣ | 𝜑 ⟩ 𝜌 𝑢 | 𝜑 ⟩ ... 𝜌 𝑁 𝑓 − 𝑢 | 𝜑 ⟩ ⎤⎥⎥⎥⎦ , (20)where 𝑢 ∈ { , . . . , 𝑁 𝑓 − } and 𝜌 𝑢 = 𝜌 ⋆𝑁 𝑓 − 𝑢 = exp( 𝜋𝑖𝑢𝑁 𝑓 ).Substituting this eigenvector Eq.(20) into Eq.(19), allowsthe block equation induced by the first row to be solvedas ˜H 𝑢 | 𝜓 𝑢 ⟩ = 𝑁 𝑓 − ∑︁ 𝑙 =0 h 𝑙 𝜌 𝑙 · 𝑢 | 𝜓 𝑢 ⟩ = 𝜆 𝑢 | 𝜓 𝑢 ⟩ , (21)The above equation can be simplified introducing thevariable 𝑚 𝑓 = {︃ 𝑁 𝑓 +12 if 𝑁 𝑓 is odd , 𝑁 𝑓 if 𝑁 𝑓 is even , (22)and by taking into account that H N is Hermitian conse-quently h 𝑗 = h † 𝑁 𝑓 − 𝑗 .The simplified version is given as ˜H 𝑢 = h + 𝑚 𝑓 − ∑︁ 𝑙 =1 (︁ h 𝑙 𝜌 𝑙 · 𝑢 + h † 𝑙 𝜌 ⋆𝑙 · 𝑢 )︁ + Δ , (23)where Δ = {︃ 𝑁 𝑐 if 𝑁 𝑓 is odd , ( − 𝑢 h 𝑚 𝑓 if 𝑁 𝑓 is even . (24)Since in the flow structure Δ = 𝑁 𝑐 , and only threeinstances h 𝑢 are non-null, we have ˜H 𝑢 = h + h 𝜌 𝑢 + h † 𝜌 ⋆𝑢 , (25)Replacing the operators h by their respective counter-parts in equation Eq.(17), we obtain the following expres-sion for the 𝑢 -th matrix in a network with 𝑁 𝑓 blocks, ˜H 𝑢 = H in + 𝑒 𝜋𝑖𝑢𝑁𝑓 H out + 𝑒 − 𝜋𝑖𝑢𝑁𝑓 H out † . (26)In the following sections we will investigate how distinct H in influence 𝑐 𝜆 . FIG. 3. Specific heat (shown in colors) in terms of the charge2 𝜋𝑞 (polar coordinates) and temperature (radial coordinate)for 𝑁 𝑓 = 3(a), 4(b) and 5(c), assuming 𝑁 𝑐 = 45. This plotwas derived from Equation Eq.(33).

1. Uniform Connections

Uniform connection is characterized by having the de-gree of each vertex given as [ D 𝑖𝑖 ] = 𝑑 = 2 𝑁 𝑐 −

1. Conse-quently, the intrablock of the magnetic Laplacian is H in = I 𝑁 𝑐 (1 + 𝑑 ) − 𝑁 𝑐 𝑑 , (27)and the interblock defining the connections between themodular structures is given as H out = − exp(2 𝜋𝑖𝑞 )2 𝑑 𝑁 𝑐 . (28)Substituting the two previous equations into Eq.(26), ˜H 𝑢 can be obtained as ˜H 𝑢 = I 𝑁 𝑐 (1 + 𝑑 ) − 𝑁 𝑐 𝑑 − 𝜋 ( 𝑢𝑁 𝑓 − 𝑞 ))2 𝑑 𝑁 𝑐 , (29)observe that ˜H 𝑢 is a circulant matrix. Due that let 𝑣 ∈{ , ..., 𝑁 𝑐 − } , and define 𝑚 𝑐 = {︃ 𝑁 𝑐 +12 if 𝑁 𝑐 is odd , 𝑁 𝑐 if 𝑁 𝑐 is even , (30)the eigenvalues of ˜H 𝑢 can be obtained as 𝜆 𝑢,𝑣 = ℎ + 𝑚 𝑐 − ∑︁ 𝑙 =1 (︁ ℎ 𝑙 𝜌 𝑙 · 𝑣 + ℎ † 𝑙 𝜌 ⋆𝑙 · 𝑣 )︁ + Δ . (31)where Δ = {︃ 𝑁 𝑐 is odd , ( − 𝑣 ℎ 𝑚 𝑐 if 𝑁 𝑐 is even . (32)Replacing ℎ 𝑙 by their counterparts in Eq.(31) the follow-ing eigenvalue equation can be obtained 𝜆 𝑢,𝑣 = 1 − cos(2 𝜋 ( 𝑢𝑁 𝑓 − 𝑞 )) 𝑑 + 2 𝑑 (︂ 𝜋 ( 𝑢𝑁 𝑓 − 𝑞 )) )︂ 𝑓 ( 𝑣, 𝑁 𝑐 , 𝑚 𝑐 ) + Δ , (33) where 𝑓 ( 𝑣, 𝑁 𝑐 , 𝑚 𝑐 ) = 𝑚 𝑐 − ∑︀ 𝑙 =1 cos( 𝜋𝑣𝑙𝑁 𝑐 ), such that 𝑓 ( 𝑣, 𝑁 𝑐 , 𝑚 𝑐 ) = {︃ 𝑚 𝑐 if 𝑣 = 0 , sin( 𝜋𝑣𝑚𝑐𝑁𝑐 )sin( 𝜋𝑣𝑁𝑐 ) cos( 𝜋𝑣𝑁 𝑐 ( 𝑚 𝑐 − . (34)The Eq.(33) indicates a rotation symmetry related tothe charge parameter in the modular directed network.These symmetries also reflect the behavior of the specificheat petal structure shown in Fig.3.

2. Asymmetries in the specific heat petal structures

The results obtained in the previous section helps tounderstand the relationship between the modular struc-tures and the magnetic Laplacian spectrum, as well asthe specific heat symmetry. However, these results as-sume that the inner structures H in are undirected. Theeffect of directionality can be inferred by generating ran-dom directions inside the intrablocks, i.e. by imposingthat [ W in ] 𝑢,𝑣 has probability 𝑝 𝑐 < 𝑝 𝑐 = 30%, we calculate the specific heatby using numeric diagonalization, yielding the structuresin Fig.4. We can observe the obtained petals are notsymmetric, unlike what had been observed for uniformconnections. B. Model characterization of directed graphs

FIG. 4. Specific heat (colors) in terms of the charge 2 𝜋𝑞 (angle) and temperature (radius), for 𝑁 𝑓 = 3(a), 4(b) and5(c), assuming 𝑁 𝑐 = 45. The networks were generated ran-domly, imposing the probability of having a directed edgeas 𝑝 𝑐 = 30%. Observe the obtained asymmetric petals con-trasting with the results obtained previously for the uniformconnections. In this section we address the task of characterizationof distinct networks models through the spectra of mag-netic Laplacian. In particular, given a set of measure-ments obtained from a graph, can we infer which modelcreated that graph? In this work, we opted to use thespecific heat, 𝑐 𝜆 , as a feature of measurement of graphs,in order to address the question above. As shown inFig.1, the 𝑐 𝜆 measures yielded specific behavior for dif-ferent models, therefore providing valuable informationthat can be use to identify and discriminate between dif-ferent complex networks models.In order to evaluate the efficiency of using 𝑐 𝜆 as a fin-gerprint of a directed network, we built a dataset with2000 network samples with types Erd˝os–R´enyi (ER),Barab´asi (BA), Bollob´as’s et al scale-free model (SF),Watts-Strogatz (WS), and SBM with 3 and 4 blocks.Then, self organizing maps (SOMs), namely a methodfor non-supervised clustering , were trained with the ob-tained 𝑐 𝜆 values and the obtained regions were subse-quently labeled. This was done by feeding each trainingdata into the SOM and choosing the neuron that exhib-ited highest activation. As indicated by the results shownin Fig.5, networks belonging to the same class have beenmapped into nearby neurons, defining respective clusters.So, the SOM was able, without previous knowledge tofind the patterns of 𝑐 𝜆 associated to the considered typesof networks.From what we have seen, we can conclude that the sug-gested magnetic Laplacian approach is able, at least forthe considered cases, to properly characterize the modelof given networks. For this reason, in a similar man-ner to that which has been applied in condensed mat-ter physics, “SOM” proved to be a powerful techniquefor characterizing complex networks when we see thesenetworks through the lens of statistical mechanics andmagnetic Laplacians. neuron index x n e u r o n i n d e x y BAERSFWSflux3flux4 U - m a t r i x d i s t a n c e FIG. 5. SOM mapping of six types of complex networks rep-resented by the specific heat approach. The neuron index x and neuron index y correspond to neurons in the SOM cor-tical space. The distances between neighboring neurons (U-matrix) are indicated in gray. A good separation between thetypes of networks can be observed.

Given that many real-world networks contain a largenumber of nodes, a question arises regarding the feasi-bility using spectral quantities for their characterization.As described in the methodology section, thanks to themagnetic Laplacian formalism, KPM can be used as ameans to estimate spectral density measurements. For .

100 0 .

125 0 .

150 0 .

175 0 .

200 0 .

225 0 . T c λ q = 0 q = 1 / q = 1 / FIG. 6. Approximated specific heat for a network with | 𝑉 | = 3000, 𝑁 𝑓 = 3, 𝑝 𝑐 = 0 .

25 and 𝑝 𝑑 = 0 .

5. In the ap-plication of KPM method the expansion was truncated at40 first terms and the stochastic trace approximation used25 random vectors. The error-bars represent the deviationbetween the exact value (obtained numerically) and the ap-proximated value calculated by the KPM method and usingnumerical integration. instance, given a modular directed network we obtainedthe exact and KPM-approximated values of the specificheat for different temperatures and charge values. Theapproximated specific heat is shown in Fig.6. The errorbars indicate a small dispersion, corroborating the po-tential of the KPM approach for studying the spectralproperties of complex networks.

C. Directed network parameter Inference

The results shown in Fig.5 indicates that, given a net-work ˜ 𝐺 , we can infer which model was responsible forgenerating it. In addition, to complete the task of char-acterizing a network it is necessary to find the networkwhich most closely resembles ˜ 𝐺 among several networkscreated with distinct parameters while fixing the model.In this section, we explore the problem of inferring theparameters of models using the spectra of the magneticLaplacian..In order to argue that the Wasserstein metric can beused combined with the KPM approach as a means toestimate the network model parameters with sufficientprecision, we study the problem of infering the conectingprobabilities ˜ 𝑝 of ER networks and the out-degree ˜ 𝑚 ofBA networks, both with approximately 10 nodes.In Fig.7 the continuous vertical lines show the correctvalue of the parameter and the vertical dashed lines iden-tify the position of the minimal of Eq.(15), which is theinferred value of the parameter. By using KPM with the100 first terms of the Chebyshev polynomial and approx-imating the trace by using 20 random vectors, we observethe parameters can be inferred with good accuracy. m . . . . . h W i (a) ˜ m =2 ; m min =2˜ m =5 ; m min =5 0 . . . . p × − (b) ˜ p =2 e − ; p min =2 . e − p =4 e − ; p min =3 . e − FIG. 7. The curves in (a) and (b) represent the mean of 1-Wasserstein distances Eq.(15), respectively to BA and ER,in terms of the parameters adopted for network generation,considering 𝑁 𝑒𝑥𝑝 = 5, | 𝑉 | = 10 , and 𝑄 = { , / } . Forspectral estimation using the KPM was used 100 terms ofexpansion and 20 random vectors. IV. CONCLUSIONS

Directed networks can be used to represent severalreal-world structures and problems. As a consequence,several approaches have been proposed aimed at charac-terizing and comparing directed networks. Among theseapproaches, spectral methods present some particularlyinteresting properties, such as bearing a direct relation-ship with the structural and dynamical aspects of givennetworks. However, when applied to directed networks,the usual Laplacian operator yields complex eigenvalues,which are difficult to treat and interpret. Nevertheless,the hermiticity property of the magnetic Laplacian al-lows a set of real eigenvalues to be associated with aweighted directed network. We showed here that realeigenvalues and the associated charge parameter conveyinformation about the network, more specifically regard-ing its mesoscale structures and the spectral and specificheat symmetry.In order to extend the proposed methodology to largernetworks containing hundreds of thousands of nodes, weshowed the KPM method can be combined with the mag-netic Laplacian approach. This combination allowed toestimate the spectral density of the magnetic operatorwith remarkable efficiency and accuracy. Given thatwe could estimate the spectral density of the magneticLaplacian, we showed that the study of spectral geome-try under the Wasserstein metric can be used as a toolto infer parameters of networks with low relative errors.The reported contributions pave the way to a num-ber of future developments and applications involvingdirected complex networks. For instance, these meth-ods can be applied to study several other theoretical andreal world structures, including fake news dissemination,metabolic networks, neuronal systems, to name but afew possibilities. It would also be interesting to performstudies using random matrix theory in order to infer rela-tionships between topology and spectra for more generalcomplex networks. Since we deal only with spectral in-formation, the results presented in this paper could alsobe immediately applied to multiplex networks.

ACKNOWLEDGEMENTS

The authors thank Thomas Peron, Henrique F. de Ar-ruda, Paulo E. P. Burke and Filipi N. Silva for all sug-gestions and useful discussions. Bruno Messias thanksCAPES for financial support. Luciano da F. Costathanks CNPq (grant no. 307085/2018-0) and NAP-PRP-USP for sponsorship. This work has been supported alsoby FAPESP grant 15/22308-2. Research carried out us-ing the computational resources of the Center for Math-ematical Sciences Applied to Industry (CeMEAI) fundedby FAPESP (grant 2013/07375-0).

DATA AVAILABLITY

Data sharing is not applicable to this article as no newdata were created or analyzed in this study. However,our implementation of KPM method it is available atgithub.com/stdogpkg/emate. In addition eMaTe also al-lows to estimate trace functions of symmetric adjacencymatrices with a good accuracy and computational effi-ciency. M. Kac, “Can one hear the shape of a drum?” The AmericanMathematical Monthly , 1–23 (1966). O. Giraud and K. Thas, “Hearing shapes of drums: Mathemati-cal and physical aspects of isospectrality,” Rev. Mod. Phys. ,2213–2255 (2010). D. Aasen, T. Bhamre, and A. Kempf, “Shape from sound: To-ward new tools for quantum gravity,” Physical Review Letters (2013), 10.1103/physrevlett.110.121301. L. Cosmo, M. Panine, A. Rampini, M. Ovsjanikov, M. M. Bron-stein, and E. Rodol`a, “Isospectralization, or how to hear shape,style, and correspondence,” (2018). D. M. Cvetkovi´c, “Graphs and their spectra,” Publikacije Elek-trotehniˇckog fakulteta. Serija Matematika i fizika , 1–50 (1971). E. R. van Dam and W. H. Haemers, “Which graphs are deter-mined by their spectrum?” Linear Algebra and its Applications , 241–272 (2003). C. Sarkar and S. Jalan, “Spectral properties of complex net-works,” Chaos: An Interdisciplinary Journal of Nonlinear Science , 102101 (2018). J. Wang, R. C. Wilson, and E. R. Hancock, “Detectingalzheimer’s disease using directed graphs,” in

Graph-Based Rep-resentations in Pattern Recognition (Springer International Pub-lishing, 2017) pp. 94–104. K. Anand and G. Bianconi, “Entropy measures for networks:Toward an information theory of complex topologies,” Phys. Rev.E , 045102 (2009). M. Dehmer and A. Mowshowitz, “A history of graph entropymeasures,” Information Sciences , 57–78 (2011). C. Ye, C. H. Comin, T. K. D. Peron, F. N. Silva, F. A. Rodrigues,L. d. F. Costa, A. Torsello, and E. R. Hancock, “Thermody-namic characterization of networks using graph polynomials,”Phys. Rev. E , 032810 (2015). M. De Domenico and J. Biamonte, “Spectral entropies asinformation-theoretic tools for complex network comparison,”Phys. Rev. X , 041062 (2016). C. Nicolini, V. Vlasov, and A. Bifone, “Thermodynamics of net-work model fitting with spectral entropies,” Phys. Rev. E ,022322 (2018). J. D. Hart, J. P. Pade, T. Pereira, T. E. Murphy, and R. Roy,“Adding connections can hinder network synchronization of time-delayed oscillators,” Phys. Rev. E , 022804 (2015). G. Berkolaiko, “Nodal count of graph eigenfunctions via magneticperturbation,” Analysis & PDE , 1213–1233 (2013). M. Fanuel, C. M. Ala´ız, and J. A. K. Suykens, “Magnetic eigen-maps for community detection in directed networks,” Phys. Rev.E , 022302 (2017). S. Furutani, T. Shibahara, M. Akiyama, K. Hato, and M. Aida,“Graph signal processing for directed graphs based on the her-mitian laplacian,” in

Machine Learning and Knowledge Discov-ery in Databases , edited by U. Brefeld, E. Fromont, A. Hotho,A. Knobbe, M. Maathuis, and C. Robardet (Springer Interna-tional Publishing, Cham, 2020) pp. 447–463. L. Kantorovitch, “On the translocation of masses.” C. R. (Dokl.)Acad. Sci. URSS, n. Ser. , 199–201 (1942). V. I. Bogachev and A. V. Kolesnikov, “The monge-kantorovichproblem: achievements, connections, and perspectives,” RussianMathematical Surveys , 785–890 (2012). G. Peyr´e and M. Cuturi, “Computational optimal transport,”Foundations and Trends R ○ in Machine Learning , 355–206(2019). B. Bollob´as, C. Borgs, J. Chayes, and O. Riordan, “Directedscale-free graphs,” in

Proceedings of the fourteenth annual ACM-SIAM symposium on Discrete algorithms (Society for Industrialand Applied Mathematics, 2003) pp. 132–139. Y. C. de Verdi`ere, “Magnetic interpretation of the nodal defecton graphs,” Analysis & PDE , 1235–1242 (2013). E. H. Lieb and M. Loss, “Fluxes, laplacians, and kasteleyn’s theo-rem,” in

Statistical Mechanics (Springer Berlin Heidelberg, 1993) pp. 457–483. K. Blum,

Density matrix theory and applications , Vol. 64(Springer Science & Business Media, 2012). A. Weiße, G. Wellein, A. Alvermann, and H. Fehske, “The kernelpolynomial method,” Rev. Mod. Phys. , 275–306 (2006). J. P. Boyd,

Chebyshev and Fourier Spectral Methods: SecondRevised Edition (Dover Books on Mathematics) (Dover Publica-tions, 2001). M. Hutchinson, “A stochastic estimator of the trace of the influ-ence matrix for laplacian smoothing splines,” Communicationsin Statistics - Simulation and Computation , 433–450 (1990). D. Jackson, “On approximation by trigonometric sums and poly-nomials,” Transactions of the American Mathematical Society , 491–491 (1912). R. M. Gray, “Toeplitz and circulant matrices: A review,” Founda-tions and Trends R ○ in Communications and Information Theory , 155–239 (2005). B. Bollob´as, C. Borgs, J. Chayes, and O. Riordan, “Directedscale-free graphs,” in

Proceedings of the Fourteenth AnnualACM-SIAM Symposium on Discrete Algorithms , SODA ’03 (So-ciety for Industrial and Applied Mathematics, Philadelphia, PA,USA, 2003) pp. 132–139. A. A. Shirinyan, V. K. Kozin, J. Hellsvik, M. Pereiro, O. Eriks-son, and D. Yudin, “Self-organizing maps as a method for de-tecting phase transitions and phase identification,” Phys. Rev. B99