[PDF] Exploring phases of the Su-Schrieffer-Heeger model with tSNE

Abstract

T-distributed stochastic neighborhood embedding (tSNE) is used as a tool to reveal the phase diagram of the Su-Schrieffer-Heeger model and some of its extended and non-Hermitian variants. Bloch vectors calculated at different points in the parameter space are mapped to a two-dimensional reduced space. The clusters in the reduced space are used to visualize different phase regions included in the input. The tSNE mapping is shown to be effective even in the challenging case of the non-Hermitian extended model where five different phases are present. An example of using wavefunction input, instead of Bloch vectors, is presented also.

Full PDF

EExploring phases of the Su-Schrieﬀer-Heeger model with tSNE

R. M. Woloshyn

TRIUMF, 4004 Wesbrook Mall, Vancouver, British Columbia, Canada V6T 2A3

T-distributed stochastic neighborhood embedding (tSNE) is used as a tool to reveal the phasediagram of the Su-Schrieﬀer-Heeger model and some of its extended and non-Hermitian variants.Bloch vectors calculated at diﬀerent points in the parameter space are mapped to a two-dimensionalreduced space. The clusters in the reduced space are used to visualize diﬀerent phase regions includedin the input. The tSNE mapping is shown to be eﬀective even in the challenging case of the non-Hermitian extended model where ﬁve diﬀerent phases are present. An example of using wavefunctioninput, instead of Bloch vectors, is presented also.

I. INTRODUCTION

Machine learning is being increasingly utilized in physics [1, 2]. Examples of applications include event classiﬁcation[3–5] and anomaly detection [6, 7] in the analysis of particle physics experiments, aiding observations in astronomy[8], and the study of phases in condensed matter systems [9–12]. The focus here on the last topic, phases of matter.Many diﬀerent machine learning methods have been applied to the exploration of phases and phase transitions.These include neural networks [13–16], principal component analysis [17, 18], support vector machines [19] anddiﬀusion maps [20–22]. In an interesting recent work, Yang et al. [23] suggested the use of t-distributed stochasticneighborhood embedding (tSNE) as an unsupervised learning method to obtain a visualization of phase diagrams andused this method to study a number of one-dimensional quantum spin systems.The idea of using unsupervised learning methods to reveal phases is particularly appealing. Speciﬁcally, topologicalphases [24] which can not be characterized by local order parameters can be studied without a priori input of domainknowledge. Ref. [21, 22, 25–30] are examples of recent work on identifying topological phases and phase transitionsusing either neural networks or diﬀusion maps. The Ref. [22, 25–29] focus on the Su-Schrieﬀer-Heeger (SSH) model[31, 32] which is also the subject of this work.The SSH model was introduced as a model for polyacetylene and is a simple extensively studied example of amodel for a topological insulator [33, 34]. The basic model consists of electrons (taken to be spinless) hopping on aone-dimensional lattice. The nearest-neighbor hopping amplitudes are taken to be staggered (see, for example, Fig.1.1 in [33]) so the lattice can be divided into two-site units cells. Some details of the model are given in Sec. III.The SSH model can be extended by introducing longer range interactions. The extended SSH model consideredhere allows for the addition of next-next-nearest neighbor hopping terms [28, 35]. A diﬀerent type of modiﬁcation ofthe model which has received considerable recent attention is to allow nonreciprocal intra-cell hoping [36, 37]. Thisleads to a non-Hermitian Hamiltonian and the appearance of topological phases with fractional winding number [37].In this paper the tSNE algorithm, which has been used to study one-dimensional spin systems [23], is applied tothe SSH model and the variants mentioned above. The algorithm is based on dimensionality reduction. The model,sampled at a variety of points in its parameter space, is described by points distributed in a high-dimensional spaceand tSNE maps this space to a lower-dimensional space where, ideally, there is a clustering of data which can be usedto identify regions of parameter space which share common features. In this work a model which allows calculations tobe made for diﬀerent parameters but the same strategy can be applied to data from experiments where measurementsare made under diﬀerent experimental conditions.The implementation of tSNE analysis requires choices to made, for example, for a distance function for points inthe high-dimensional input space and for the algorithm to identify clusters in the tSNE output. These issues arediscussed in Sec. IV and in the Appendices. With appropriate choices it is found that the unsupervised tSNE analysiscan give a correct visualization of the phase diagram even the most challenging case of the non-Hermitian extendedSSH model where ﬁve phases are present.The tSNE algorithm is introduced in Sec. II and Sec. III gives a brief outline of the SSH model and some of itsextensions. Results are presented in Sec. IV with a summary in Sec. V.

II. T-DISTRIBUTED STOCHASTIC NEIGHBORHOOD EMBEDDING

T-distributed stochastic neighborhood embedding [38] (tSNE) is a based on the notion of dimensionality reduction.The system to be analyzed is described by a set of points { x } in a space with large dimension which may obscure thepresence features with a much smaller dimensionality. The idea of tSNE is to construct a mapping of { x } to points a r X i v : . [ c ond - m a t . m e s - h a ll ] J a n { y } in a space of low dimension (typically 2 or 3) in such a way that points in the high-dimensional space that sharesome common feature will lie close together in the low-dimensional space.A key element of tSNE, common to other some machine learning algorithms, for example, diﬀusion maps, is thedistance D ij ( x i , x j ) between points in the high-dimensional space. The distance D is commonly taken to be theEuclidean distance between the points but other metrics can be used and, depending on the application, may be moreeﬀective. The ﬁrst step of the algorithm is to assign a conditional probability that point i should have point j as aneighbor P j | i = e − D ij / σ i (cid:80) k (cid:54) = i e − D ik / σ i , (1)for all i (cid:54) = j. The σ i ’ s are hyperparameters of the algorithm. Then a probability P ij is deﬁned by P ij = P i | j + P j | i N , (2)where N is the number of points in the set { x } .A probability distribution Q ij for points { y } in the low-dimensional space is also deﬁned. It is taken to be a Studentt-distribution Q ij = (cid:104) (cid:107) y i − y j (cid:107) (cid:105) − (cid:80) k (cid:54) = i (cid:104) (cid:107) y i − y k (cid:107) (cid:105) , (3)using Euclidean distance. The algorithm then tries to make the probability distribution Q similar to P by ﬁnd points { y } which minimize the Kullback-Leibler divergence deﬁned by (cid:88) ij P ij log P ij Q ij . (4)The tSNE algorithm is included in the machine learning toolkit scikit-learn [39, 40] and that is the implementationthat will be used in this work. In the implementation of tSNE the hyperparameters are ﬁxed implicitly by requiringthat the so-called perplexity P , deﬁned by log P = − (cid:88) j P j | i log P j | i , (5)takes a speciﬁed value. III. THE SU-SCHRIEFFER-HEEGER MODEL

The SSH model was introduced as a model for polyacetylene [31, 32]. It consists of electrons hopping on one-dimensional lattice with a two-site unit cell. The intra-cell and inter-cell couplings are taken to be diﬀerent. TheHamiltonian for the basic model with L unit cells is H = L (cid:88) j =1 (cid:104) t a † j b j + t a † j +1 b j (cid:105) + h.c. (6)where a † i and b i denote creation and annihilation operators on the two diﬀerent sites of the i’th unit cell. With periodicboundary conditions the Hamiltonian can be written in momentum space as H ( k ) = d x ( k ) σ x + d y ( k ) σ y , (7)where the σ ’s are Pauli matrices and the Bloch vectors are d x ( k ) = t + t cos( k ) ,d y ( k ) = t sin( k ) . (8)The momenta k take values in [ , π ] . The SSH model has a nontrivial topological property. If t > t the vector d = ( d x , d y ) will rotate about the origin of the d x − d y plane as k varies from to π (see, for example, Fig. 8 in Ref[34]). Note that the trajectory of the Bloch vectors passing through the point d = 0 is the condition that the modelis gapless Ref [37]. This can occur when t = ± t which is the boundary in parameter space between diﬀerent phases.When t > t the winding number w = 1 where for our discretized lattice w = 12 π L (cid:88) j ∆φ ( j ) , (9)where ∆φ ( j ) = | φ ( j ) − φ ( j − | mod π with φ j equal to the phase of d x ( k ) + id y ( k ) for k = 2 πj/L . When t < t , thewinding number vanishes.The SSH model can be extended to include next-to-next-nearest neighbor inter-cell hoping and the coupling betweennearest neighbor cells and next-to-next-nearest neighbor cells can be taken to be diﬀerent. In this case the Blochvectors take the form d x ( k ) = t + t cos( k ) + t cos(2 k ) ,d y ( k ) = t sin( k ) + t sin(2 k ) . (10)In this case the vector d will trace out a double loop in the d x − d y plane and the winding number can take values 0,1 or 2 (See, for example, [28, 35]).Another extension of the SSH model is to allow for a non-reciprocal inter-cell hopping strength. This leads to anon-Hermitian Hamiltonian. In the non-Hermitian extension considered here the Bloch vectors take the form [29, 37] d x ( k ) = t + t cos( k ) ,d y ( k ) = t sin( k ) − iδ, (11)where δ is a real parameter characterizing the diﬀerence between left and right intra-cell hopping. The condition forthe energy gap to vanish becomes t = ± t ± δ . In addition to phases with winding number 0 and 1, there are regionsof parameter space where w = 1 / .The extended SSH model (10) can also be modiﬁed to include non-Hermiticity. The Bloch vectors become [37] d x ( k ) = t + t cos( k ) + t cos(2 k ) ,d y ( k ) = t sin( k ) + t sin(2 k ) − iδ. (12)The phase diagram becomes quite complicated with regions of winding number 1/2 and 3/2 appearing along withwinding numbers 0, 1 and 2 that are present when δ = 0. IV. RESULTSA. Bloch vector input

In this section results of applying the tSNE reduction to the SSH models of Section are described. The Bloch vectorscalculated at diﬀerent points lying in a two-dimensional plane of parameter space and spanning diﬀerent phases areused as input into the algorithm. The reduced space is taken to be two-dimensional. As mentioned in Sec. II thedistance function in the space of input vectors need not be the Euclidean distance. Scikit-learn provides a varietyof metrics. In this work the L p norms with p = 1 , , ∞ were considered. In scikit-learn these are called Cityblock,Euclidean and Chebyshev respectively. Recall that the L p norm of a vector v is L p = (cid:32)(cid:88) i | v i | p (cid:33) p . (13)If the number of clusters to which the data are mapped is very small the choice of metric may not be critical. However,if, for example, the input data span a large number phases some metrics may be more eﬀective than others. Theexample in Appendix A Fig. 6 of the non-Hermitian extended SSH model shows that using the Chebyshev distanceleads to well separated clusters while other choices do not. All results presented in this Section based on Bloch vectorinput were calculated using the Chebyshev distance. Note the use of this metric was earlier advocated by Che et al. [25] in their study topological phases using diﬀusion maps. . . . . . . . . . t . . . . . . . . . t − −

10 0 10 20 y − − − − y tsne[Chebyshev] − −

10 0 10 20 y − − − − y k-means . . . . . . . . . t . . . . . . . . . t Figure 1. Phases of the SSH model exposed by tSNE.Top-left:Points in the t − t parameter space at which input Bloch vectorsare calculated. Top-right:Output of tSNE. Bottom-left:Clusters identiﬁed by k-means. Bottom-right:Points in parameter spacewith color and symbol showing the cluster to which they correspond. The black line shows the known phase boundary. . . . . . . . . . t − − − − t − − −

10 0 10 20 30 40 y − − − y tsne[Chebyshev] − − −

10 0 10 20 30 40 y − − − y dbscan . . . . . . . . . t − − − − t Figure 2. Phases of the extended SSH model exposed by tSNE.Top-left:Points in the t − t parameter space with t = 1 at whichinput Bloch vectors are calculated. Top-right:Output of tSNE. Bottom-left:Clusters identiﬁed by dbscan. Bottom-right:Pointsin parameter space with color and symbol showing the cluster to which they correspond. The black lines show the known phaseboundaries. .

00 0 .

25 0 .

50 0 .

75 1 .

00 1 .

25 1 .

50 1 .

75 2 . δ . . . . . . . . . t − −

20 0 20 y − − − − y tsne[Chebyshev] − −

20 0 20 y − − − − y dbscan .

00 0 .

25 0 .

50 0 .

75 1 .

00 1 .

25 1 .

50 1 .

75 2 . δ . . . . . . . . . t Figure 3. Phases of the non-Hermitian SSH model exposed by tSNE.Top-left:Points in the δ − t parameter space with t =1 at which input Bloch vectors are calculated. Top-right:Output of tSNE. Bottom-left:Clusters identiﬁed by bdscan. Bottom-right:Points in parameter space with color and symbol showing the cluster to which they correspond. The black lines show theknown phase boundaries. First consider the basic SSH model Eq. 8. The model has two parameters. A sample of 400 Bloch vectorscorresponding to a lattice of 80 cells was constructed using parameters chosen randomly in the range ≤ t , t ≤ . The selected points are shown in the top-left panel of Fig. 1 The result of tSNE mapping of the Bloch vectors toa two-dimensional space using the Chebyshev distance is shown in the top-right panel the ﬁgure. The default valueof the perplexity, P = 30, was used. Two clusters can be identiﬁed by inspection or by using a clustering algorithmsuch as k-means [41]. The clusters, coded by color and symbol, are displayed in the bottom left panel. The points inparameter space are replotted in the bottom-right panel now coded with color and symbol indicating the cluster towhich they were grouped by tSNE. The diagonal line is the phase boundary between the topological phase t > t with winding number equal to 1 and the band insulator phase t < t with winding number 0. One sees that tSNEmakes a clear separation of the phases in a unsupervised way giving a visualization of the phase diagram.The extended SSH model, Eq. 10, has three parameters. We consider the model at t = 1 and allow the otherparameters to vary in the range ≤ t ≤ and − ≤ t ≤ . The randomly selected points are shown in the top-leftpanel of Fig. 2 and the tSNE output is in the top-right panel. This model has a more complicated phase diagramthan the basic SSH model (see Fig. 5 in [28]) and the choice of clustering algorithm becomes an issue. The resultsof using k-means clustering are compared to dbscan [41, 42] in Appendix B Fig. 8. The dbscan clusters, coded withdiﬀerent colors and symbols, are shown in the bottom-left of panel Fig. 2. The corresponding points in the t − t plane along with the known phase boundaries [28] are plotted in the bottom-right panel. The tSNE analysis gives aclean separation of the phase diagram into four regions in agreement with known results. Note that this would notbe the case if k-means were used for cluster identiﬁcation.The phase diagram of the non-Hermitian SSH model , Eq. 11, is shown in Fig. 3 of Ref. [37] for t = 1. For this workwe consider a quadrant of the parameter space with ≤ δ, t ≤ . The input Bloch vectors were calculated at the 400points plotted in the top-left panel of Fig. 3 and top-right panel shows the tSNE output. Cluster identiﬁcation usingdbscan is shown in the bottom-left panel. The four distinct regions in the parameter space are correctly identiﬁed(bottom-right panel) as indicated by the black lines showing the phase boundaries.The non-Hermitian extended SSH model, Eq. 11, presents more of a challenge. There are ﬁve phases with diﬀerent . . . . . . . δ . . . . . . . t − −

20 0 20 40 y − − y tsne[Chebyshev] − −

20 0 20 40 y − − y dbscan . . . . . . . δ . . . . . . . t Figure 4. Phases of the non-Hermitian extended SSH model exposed by tSNE.Top-left:Points in the δ − t parameter space with t , t = 1 at which input Bloch vectors are calculated. Top-right:Output of tSNE. Bottom-left:Clusters identiﬁed by dbscan.Bottom-right:Points in parameter space with color and symbol showing the cluster to which they correspond. The black linesshow the known phase boundaries. winding numbers. The phase diagram in the δ − t plane for t , t = 1 is given in Fig. 5 of Ref. [37]. Here 529 pointsin the range ≤ δ, t ≤ are used for the input Bloch vectors. These are shown in the top-left panel of Fig. 4 andthe resulting tSNE mapping is in the top-right panel. A perplexity of 20 was used in this case as it was found togive a better separation of the clusters than the default value of 30. The dbscan identiﬁcation of clusters, depicted indiﬀerent colors and symbols, is in the lower-left panel. The bottom-right panels shows the points in the δ − t planeassociated with diﬀerent clusters. Phase boundaries are shown by the black lines. It seems quite remarkable that thecombination of tSNE dimensionality reduction and dbscan clustering can distinguish all six regions even though someare small and represented by only a few points in the input sample. B. Wavefunction input

An alternative to using the Bloch vectors to explore the phases is to use the eigenvectors of the real-space Hamilto-nian. For a lattice of L cells the Hamiltonian can be expressed as an L × L sparse matrix with nonzero entries alongthe super- and sub-diagonal [33]. Let | ψ i > and | ψ j > denote the lowest positive energy eigenvectors for two diﬀerentchoices of the model parameters. An L p norm of the diﬀerence between these vectors can be used as a distance in antSNE analysis but this may not be the best choice. Yang et al. [23] suggest that for visualizing quantum phases withtSNE a better measure of distance between quantum states would be − log | < ψ i | ψ j > | (14)which they call the negative logarithmic ﬁdelity (NLF). As an example, Fig. 7 in Appendix A shows the tSNEreduction using a sample of 400 SSH model wavefunctions calculated on a lattice of 16 cells for diﬀerent randomlychosen t and t values. Since the model has two phases two clusters of points in the y − y plane are expected. Withan L p norm one sees two clusters of outliers in addition to the main clusters which makes the association of clusterswith phases problematic. With NLF there is a clear separation into reasonably compact clusters. The complete . . . . . . . . . t . . . . . . . . . t − − y − . − . − . . . . . . . y tsne[NLF] − − y − . − . − . . . . . . . y k-means . . . . . . . . . t . . . . . . . . . t Figure 5. Phases of the SSH model exposed by tSNE with wavefunction input. Top-left:Points in the t − t parameter spaceat which input wavefunctions are calculated. Top-right:Output of tSNE. Bottom-left:Clusters identiﬁed by k-means. Bottom-right:Points in parameter space with color and symbol showing the cluster to which they correspond. The black line shows theknown phase boundary. analysis with wavefunction input using NLF is shown in Fig. 5. The two phases are clearly distinguished just as inFig. 1 where Bloch vectors were used. V. SUMMARY AND DISCUSSION

Machine learning oﬀers many ways of exploring phase transitions in condensed matter systems. Particularly inter-esting are unsupervised methods which require no a priori knowledge such a as choice of an order parameter. Suchmethods are especially suited to the study of topological phase transitions where there is no local order parameter.In this work it is shown that t-distributed Stochastic Neighborhood Embedding it is can be used to learn the phaseboundaries of the Su-Schrieﬀer-Heeger model and some of its extensions. Input into the analysis consists of Blochvectors constructed at diﬀerent points of the model parameter space. The tSNE algorithm was used to map the Blochvectors to clusters in a reduced space (two-dimensional in this work) corresponding to diﬀerent regions of the phasediagram. This allows a visualization of the phase diagram as shown in Figs. 1 to 4. Wavefunction input can also beused in the analysis as shown in Fig. 5.tSNE does not work automatically. The choice of distance function used to construct the probability distributionof the input can aﬀect the results. For Bloch vector input, the Chebyshev distance was preferred since it worked welleven in the diﬃcult case of the non-Hermitian extended SSH model. With wavefunction input, negative logarithmicﬁdelity [23] was found to useful whereas L p norms did not produce usable even in the simple case with only twophases. As well, the perplexity may require some adjustment to get a good separation of the clusters in the reducedspace. Since tSNE is stochastic, the output varies from run to run so making adjustments to the algorithm requiressome care.Although tSNE can be used to identify regions in diﬀerent phases, it can not ﬁnd the properties, such as the windingnumber, associated to diﬀerent regions. This true also for other machine learning methods, for example, principalcomponent analysis or diﬀusion maps, used to expose phase boundaries. Nonetheless, since these learning algorithmsare unsupervised, they can provide useful information about phase transitions without input of domain knowledge. ACKNOWLEDGMENTS

TRIUMF receives federal funding via a contribution agreement with the National Research Council of Canada.

Appendix A: Choice of metric . . . . . . . δ . . . . . . . t − − −

10 0 10 20 30 y − − y tsne[Cityblock] − −

20 0 20 40 60 y − − − y tsne[Euclidean] − −

20 0 20 40 y − − y tsne[Chebyshev] Figure 6. tSNE mapping of the non-Hermitian extended SSH model using diﬀerent distance functions.Top-left:Points in the δ − t parameter space with t , t = 1 at which input Bloch vectors are calculated. The other panels show tSNE output usingthe distance function indicated in the panel title in Eq. 1. The choice of distance function used in constructing the probability distribution Eq. 1 in the input space can aﬀectthe mapping in the reduced space. As an example the case of the non-Hermitian extended SSH model is presentedhere. Bloch vectors at 529 points selected in the range ≤ δ, t ≤ with t , t = 1 were used as input. These areplotted in the top-left panel of Fig. 6. The black lines show phase boundaries. The other panels show the tSNEmapping to two-dimensional space where the distance function used is indicated in the panel title. Cityblock is thescikit-learn name for the L norm. Only the Chebyshev distance leads to a usable clustering corresponding to thediﬀerent regions of the phase diagram. Cityblock clusters the points correctly but does provide any separation betweenpoints mapped from the two small regions indicated by diamonds and +’s. In this example the input data are labeledto illustrate the eﬀect of diﬀerent distance functions. In an analysis where the input is not labeled separation ofclusters from diﬀerent regions is critical in order to identify phases correctly.Scikit-learn provides a variety of distance functions but sometimes a custom distance function may have to becrafted in order to get good results. Fig. 7 shows the tSNE mapping of the SSH model for wavefunction input usingdiﬀerent distance functions as indicated in the panel titles. This model has two phases so in the reduced space onewould expect to see two clusters of points if tSNE is identifying the phases correctly. This is not case using theCityblock, Euclidean and Chebyshev distances. A binary classiﬁcation of the points is problematic. However, using − − y − . − . − . . . . . . . y tsne[NLF] − −

10 0 10 20 y − − y tsne[Cityblock] − − −

10 0 10 20 30 40 y − − y tsne[Euclidean] − − −

10 0 10 20 30 y − − − − y tsne[Chebyshev] Figure 7. tSNE mapping of the SSH model with wavefunction input using the distance function indicated in the panel title inEq. 1. − − −

10 0 10 20 30 40 y − − − y dbscan − − −

10 0 10 20 30 40 y − − − y k-means Figure 8. Comparison of dbscan and k-means clsutering of tSNE output for the extended SSH model shown in the top-rightpanel of Fig. 2.

Appendix B: k-means versus dbscan

After mapping of the input data to a reduced space one would like to identify clusters of points sharing somecommon features. This could be done by inspection but better would be to have an algorithmic procedure which moreobjective. However, this can lead to a problem if the choice of clustering algorithmic is not appropriate. An exampleis shown in Fig. 8.The two panels panels show the clusters, indicated by diﬀerent symbols and colors, returned by the dbscan andk-means clustering algorithms [41] for the tSNE output of the extended SSH model (top-right panel of Fig. 2). ThetSNE output has well separated clusters but they are not particularly compact. The k-means clustering algorithm,which calculates distances from a set of centroids, fails in this case where there is a cluster that is quite extended. Onthe other hand, dbscan, which analyzes the data by dividing it into subgroups based on the density of points, givesthe correct identiﬁcation of clusters associated with diﬀerent regions of the phase diagram as shown in Fig. 2. [1] P. Mehta et al. , Physics Reports , 1 (2019).[2] G. Carleo et al. , Reviews of Modern Physics , 045002 (2019).[3] P. T. Komiske, E. M. Metodiev and M. D. Schwartz, Journal of High Energy Physics , 110 (2017).[4] E. M. Metodiev, B. Nachman and J. Thaler, Journal of High Energy Physics , 174 (2017).[5] A. Butter, G. Kasieczka, T. Plehn and M. Russell, SciPost Physics , 028 (2018).[6] M. Farina, Y. Nakai and D. Shih, Physical Review D , 075021 (2020).[7] J. Hajer, Y.-Y. Li, T. Liu and H. Wang, Physical Review D , 076015 (2020).[8] C. J. Fluke and C. Jacobs, WIREs Data Mining and Knowledge Discovery , e1349 (2019).[9] S. J. Wetzel, Physical Review E , 022140 (2017).[10] J. Carrasquilla and R. G. Melko, Nature Physics , 431 (2017).[11] E. P. L. van Nieuwenburg, Y.-H. Liu and S. D. Huber, Nature Physics , 435 (2017).[12] M. S. Scheurer and R.-J. Slager, Physical Review Letters , 226401 (2020).[13] S. J. Wetzel and M. Scherzer, Physical Review B , 184410 (2017).[14] P. Suchsland and S. Wessel, Physical Review B , 174435 (2018).[15] P. Huembeli, A. Dauphin and P. Wittek, Physical Review B , 134109 (2018).[16] N. Yoshioka, Y. Akagi and H. Katsura, Physical Review B , 205110 (2018).[17] L. Wang, Phys. Rev. B , 195105 (2016).[18] W. Hu, R. R. P. Singh and R. T. Scalettar, Physical Review E , 062122 (2017).[19] P. Ponte and R. G. Melko, Physical Review B , 205146 (2017).[20] J. F. Rodriguez-Nieva and M. S. Scheurer, Nature Physics , 790 (2019).[21] J. Wang, W. Zhang, T. Hua and T.-C. Wei, Unsupervised learning of topological phase transitions using calinski-harabazindex, 2020, [arXiv:2010.06136].[22] Y. Long, J. Ren and H. Chen, Physical Review Letters , 185501 (2020).[23] Y. Yang, Z.-Z. Sun, S.-J. Ran and G. Su, Visualizing quantum phases and identifying quantum phase transitions bynonlinear dimensionality reduction, 2020, [arXiv:2006.08461].[24] C.-K. Chiu, J. C. Y. Teo, A. P. Schnyder and S. Ryu, Rev. Mod. Phys. , 035005 (2016).[25] Y. Che, C. Gneiting, T. Liu and F. Nori, Physical Review B , 134213 (2020).[26] L.-F. Zhang et al. , Machine learning topological invariants of non-hermitian systems, 2020, [arXiv:2009.04058].[27] B. Narayan and A. Narayan, Machine learning non-hermitian topological phases, 2020, [arXiv:2009.06476].[28] A. Kerr, G. Jose, C. Riggert and K. Mullen, Automatic learning of topological phase boundaries, 2020, [arXiv:2010.13236].[29] L.-W. Yu and D.-L. Deng, Unsupervised learning of non-hermitian topological phases, 2020, [arXiv:2010.14516].[30] N. Käming et al. , Unsupervised machine learning of topological phase transitions from experimental data, 2021,[arXiv:2101.05712].[31] W. P. Su, J. R. Schrieﬀer and A. J. Heeger, Phys. Rev. Lett. , 1698 (1979).[32] W. P. Su, J. R. Schrieﬀer and A. J. Heeger, Phys. Rev. B , 2099 (1980).[33] J. K. Asbóth, L. Oroszlány and A. Pályi, Lecture Notes in Physics (2016).[34] N. Batra and G. Sheet, Resonance , 765 (2020).[35] H.-C. Hsu and T.-W. Chen, Physical Review B , 205425 (2020).[36] L. Li, Z. Xu and S. Chen, Physical Review B , 085111 (2014).[37] C. Yin, H. Jiang, L. Li, R. Lü and S. Chen, Physical Review A , 052115 (2018).[38] L. van der Maaten and G. Hinton, Journal of Machine Learning Research , 2579 (2008).[39] F. Pedregosa et al. , Journal of Machine Learning Research , 2825 (2011). [40] https://scikit-learn.org/stable/modules/manifold.html .[41] https://scikit-learn.org/stable/modules/clustering.html .[42] E. Schubert, J. Sander, Martin, H.-P. Kriegel and X. Xu, ACM Transactions on Database Systems42