[PDF] SDSS DR7 superclusters. Principal component analysis

Abstract

We apply the principal component analysis and Spearman's correlation test to study the properties of superclusters drawn from the SDSS DR7. We analyse possible selection effects in the supercluster catalogue, study the physical and morphological properties of superclusters, find their possible subsets, and determine scaling relations for superclusters. We show that the parameters of superclusters do not correlate with their distance. The correlations between the physical and morphological properties of superclusters are strong. Superclusters can be divided into two populations according to their total luminosity. High-luminosity superclusters form two sets, more elongated systems with the shape parameter K_1/K_2 < 0.5 and less elongated ones with K_1/K_2 > 0.5. The first two principal components account for more than 90% of the variance in the supercluster parameters and define the fundamental plane, which characterises the physical and morphological properties of superclusters. We use principal component analysis to derive scaling relations for superclusters, in which we combine the physical and morphological parameters of superclusters. Structure formation simulations for different cosmologies, and more data about the local and high redshift superclusters are needed to understand better the evolution and the properties of superclusters.

Full PDF

aa r X i v : . [ a s t r o - ph . C O ] A ug Astronomy&Astrophysicsmanuscript no. AA17529 c (cid:13)

ESO 2011August 23, 2011

SDSS DR7 superclusters

Principal component analysis

M. Einasto , L.J. Liivam¨agi , , E. Saar , , J. Einasto , , , E. Tempel , E. Tago , and V.J. Mart´ınez Tartu Observatory, 61602 T˜oravere, Estonia Institute of Physics, Tartu University, T¨ahe 4, 51010 Tartu, Estonia Estonian Academy of Sciences, EE-10130 Tallinn, Estonia ICRANet, Piazza della Repubblica 10, 65122 Pescara, Italy Observatori Astron`omic, Universitat de Val`encia, Apartat de Correus 22085, E-46071 Val`encia, SpainReceived ... / Accepted ...

ABSTRACT

Context.

The study of superclusters of galaxies helps us to understand the formation, evolution, and present-day properties of thelarge-scale structure of the Universe.

Aims.

We use data about superclusters drawn from the SDSS DR7 to analyse possible selection e ﬀ ects in the supercluster catalogue,to study the physical and morphological properties of superclusters, to ﬁnd their possible subsets, and to determine scaling relationsfor our superclusters. Methods.

We apply principal component analysis and Spearman’s correlation test to study the properties of superclusters.

Results.

We have found that the parameters of superclusters do not correlate with their distance. The correlations between the physicaland morphological properties of superclusters are strong. Superclusters can be divided into two populations according to their totalluminosity: high-luminosity ones with L g >

400 10 h − L ⊙ , and low-luminosity systems. High-luminosity superclusters form twosets, which are more elongated systems with the shape parameter K / K < . K / K > .

5. The ﬁrst twoprincipal components account for more than 90% of the variance in the supercluster parameters. We use principal component analysisto derive scaling relations for superclusters, in which we combine the physical and morphological parameters of superclusters.

Conclusions.

The ﬁrst two principal components deﬁne the fundamental plane, which characterises the physical and morphologicalproperties of superclusters. Structure formation simulations for di ﬀ erent cosmologies, and more data about the local and high redshiftsuperclusters are needed to understand the evolution and the properties of superclusters better. Key words. cosmology: observations – cosmology: large-scale structure of the Universe; clusters of galaxies

1. Introduction

The large-scale distribution of the dark and baryonic matter inthe Universe can be described as the cosmic web – the net-work of galaxies, groups, and clusters of galaxies connectedby ﬁlaments (Joeveer et al. 1978; Gregory & Thompson 1978;Zeldovich et al. 1982; de Lapparent et al. 1986). In this net-work superclusters are the largest density enhancements formedby the density perturbations on a scale of about 100 h − Mpc( H = h km s − Mpc − ). Numerical simulations show thathigh-density peaks in the density distribution (the seeds of su-percluster cores) are seen already at very early stages of the for-mation and evolution of structure (Einasto 2010). These are thelocations of the formation of the ﬁrst objects in the Universe(e.g. Venemans et al. 2004; Mobasher et al. 2005; Ouchi et al.2005; Hatch et al. 2011). Studying the properties of superclus-ters helps us to understand the formation, evolution, and proper-ties of the large-scale structure of the Universe (Ho ﬀ man et al.2007; Araya-Melo et al. 2009a; Bond et al. 2010, and referencestherein). Comparison of observed and simulated superclusters,especially extreme systems among them, is a test of cosmo-logical models (Kolokotronis et al. 2002; Einasto et al. 2007a,e;Araya-Melo et al. 2009a; Einasto et al. 2011b; Sheth & Diaferio2011). Send o ﬀ print requests to : M. Einasto The ﬁrst step in supercluster studies is to compile su-percluster catalogues, which serve as observational databases.Supercluster catalogues have been constructed using the friend-of-friend method or using a smoothed density ﬁeld of galax-ies. The ﬁrst method has been applied to the data on rich(Abell) clusters of galaxies to obtain catalogues of superclus-ters of rich clusters, both from observations and simulations(Zucca et al. 1993; Einasto et al. 1994; Kalinkov & Kuneva1995; Einasto et al. 1997, 2001; Wray et al. 2006). Densityﬁeld superclusters have been determined using data of deepsurveys of galaxies (Basilakos 2003; Einasto et al. 2003a;Erdo˘gdu et al. 2004; Einasto et al. 2006, 2007b; Liivam¨agi et al.2010; Costa-Duarte et al. 2011; Luparello et al. 2011). Theproperties of superclusters have been studied, for ex-ample, by Jaaniste et al. (1998), Kolokotronis et al. (2002),Costa-Duarte et al. (2011), Luparello et al. (2011), Wray et al.(2006), and Einasto et al. (2001, 2007a,c,e, 2011a). These stud-ies show that the properties of superclusters are correlated.More luminous superclusters are richer and larger, contain richergalaxy clusters, and have higher maximum densities of galaxiesthan less luminous systems. High-luminosity superclusters aremore elongated and have more complicated inner structure thanlow-luminosity ones.In the present paper we use the Spearman’s correlation testand the principal component analysis (PCA), an excellent tool

1. Einasto et al.: PCA for multivariate data analysis, to investigate how strong the cor-relations between the properties of superclusters are. Our goalsare to analyse the presence of possible distance-dependent se-lection e ﬀ ects in the supercluster catalogue, to study the cor-relations between the physical and morphological properties ofsuperclusters, to ﬁnd the possible subsets and outliers of super-clusters, and to determine the scaling relations for the superclus-ters.Principal component analysis have been used in astron-omy for a number of purposes: the study of the prop-erties of stars (Tiit & Einasto 1964; Deeming 1964), spec-tral classiﬁcation of galaxies (S´anchez Almeida et al. 2010,and references therein), morphological classiﬁcation of galax-ies (Coppa et al. 2010), studies of galaxies, galaxy groups,and dark matter haloes (Efstathiou & Fall 1984; Lanzoni et al.2004; Ferreras et al. 2006; Woo et al. 2008; Chang et al. 2010;Ishida & de Souza 2011; Toribio et al. 2011; Skibba & Maccio’2011; Jeeson-Daniel et al. 2011, and references therein), for theHubble parameter reconstruction (Ishida & de Souza 2011, andreferences therein), and for studies of star formation history inthe universe using gamma ray bursts (Ishida et al. 2011). Ourstudy is the ﬁrst in which the PCA is applied to explore the prop-erties of superclusters of galaxies.In Sect. 2 we give data about superclusters. In Sect. 3 we de-scribe the PCA and the Spearman’s correlation test, and applythem in Sect. 4 to study the physical and morphological prop-erties of superclusters and to derive scaling relations for the su-perclusters. We discuss selection e ﬀ ects in Sect. 5 and give ourconclusions in Sect. 6.We assume the standard cosmological parameters: theHubble parameter H = h km s − Mpc − , the matter den-sity Ω m = .

27, and the dark energy density Ω Λ = .

2. Data

We selected the MAIN galaxy sample of the 7th data release ofthe Sloan Digital Sky Survey (Adelman-McCarthy et al. 2008;Abazajian et al. 2009) with the apparent r magnitudes 12 . ≤ r ≤ .

77, excluding duplicate entries. The sample is describedin detail in Tago et al. (2010), hereafter T10. We corrected theredshifts of galaxies for the motion relative to the CMB andcomputed the co-moving distances (Mart´ınez & Saar 2002) ofgalaxies.We calculated the galaxy luminosity density ﬁeld to recon-struct the underlying mass distribution. To determine superclus-ters (extended systems of galaxies) in the luminosity densityﬁeld we created a set of density contours by choosing a densitythreshold and deﬁne connected volumes above a certain densitythreshold as superclusters. In order to choose proper density lev-els to determine individual superclusters, we analysed the den-sity ﬁeld superclusters at a series of density levels. As a result weused the density level D = . ℓ mean = · − h − L ⊙ ( h − Mpc) )to determine individual superclusters. At this density level su-perclusters in the richest chains of superclusters in the volumeunder study still form separate systems; at lower density levelsthey join into huge percolating systems. At higher threshold den-sity levels superclusters are smaller and their number decreases.In our ﬂux-limited catalogue the luminosity-dependent se-lection e ﬀ ects are the smallest at the distance interval 90 h − Mpc ≤ D com ≤ h − Mpc. For the present study we chose su-perclusters of galaxies in this distance interval. There are 125superclusters in the sample. Even the poorest systems in our sample contain several groups of galaxies. These systems canbe compared with the Local supercluster containing one clus-ter of galaxies with outgoing ﬁlaments. In the Appendix A wegive the details of the calculations of galaxy luminosities andof the luminosity density ﬁeld, as well as of the selection ef-fects. The description of the supercluster catalogues is given inLiivam¨agi et al. (2010, hereafter L10). The superclusters can be characterised by the followingphysical parameters: the total weighted luminosity of galaxies ina supercluster, L g , the volume Volume , the diameter

Diameter ,and the number of galaxies in superclusters, N gal. The super-cluster volume is calculated from the density ﬁeld as the numberof connected grid cells multiplied by the cell volume: Volume = N scl ∆ , (1)where ∆ is the grid cell length.The total luminosity of the superclusters L g is calculated asthe sum of weighted galaxy luminosities: L g = X gal ∈ scl W L ( d gal ) L gal . (2)Here the W L ( d gal ) is the distance-dependent weight of a galaxy(the ratio of the expected total luminosity to the luminositywithin the visibility window). We describe the calculation ofweights in Appendix A. The diameter of a supercluster is deﬁnedas the maximum distance between its galaxies. The distance ofa supercluster is the distance to it’s density maximum. The peakdensity D peak is that of the highest density peak within the su-percluster. Usually the highest values of densities coincide withthe richest cluster of galaxies in a supercluster. For details werefer to L10.The overall morphology of a supercluster is described bythe shapeﬁnders K (planarity) and K (ﬁlamentarity), and theirratio, K / K (the shape parameter). The shapeﬁnders are cal-culated using the volume, area, and integrated mean curvatureof a supercluster; they contain information both about the sizesof superclusters and about their outer shape. Systems with dif-ferent shapes and similar sizes have di ﬀ erent shape parameters(Einasto et al. 2008). For the ﬁrst time the shapeﬁnders were ap-plied in the studies of galaxy systems by Basilakos et al. (2001)who analysed the shapes of the PSCz superclusters. We usethe maximum value of the fourth Minkowski functional V (theclumpiness) to characterise the inner structure of the superclus-ters. The larger the value of V , the more complicated the innermorphology of a supercluster is; superclusters may be clumpy,and they also may have holes or tunnels in them (Einasto et al.2007e, 2011b).The formulae for the Minkowski functionals andshapeﬁnders are given in App.B.The large-scale distribution of superclusters is shown inFig. 1 in cartesian coordinates. These coordinates are deﬁnedas in Park et al. (2007) and in Liivam¨agi et al. (2010): x = − d sin λ, y = d cos λ cos η, z = d cos λ sin η, (3)where d is the comoving distance, and λ and η are the SDSS sur-vey coordinates. Einasto et al. (2011a) gave detailed descriptionof the large-scale distribution of rich superclusters. The supercluster catalogues can be downloaded from: http://atmos.physic.ut.ee/˜juhan/super/ .2. Einasto et al.: PCA x y −300 −200 −100 0 100 200 300100200300 x z −300 −200 −100 0 100 200 300−200−1000100200 Fig. 1.

The distribution of superclusters in cartesian coordinates, in units of h − Mpc. The ﬁlled circles denote superclusters with theluminosity L g >

400 10 h − L ⊙ , empty circles denote less luminous superclusters. The numbers are ID’s of luminous superclusterfrom L10 (Table C.1). Lg d −3.0 −1.0 1.0 3.00.00.10.20.30.4 Volume d −2.0 0.0 2.00.00.10.20.3 Diameter d −2.0 0.0 2.0 4.00.00.10.20.30.4 D(peak) d −2.0 0.0 2.0 4.00.00.10.20.30.4 N(gal) d −3.0 −1.0 1.0 3.00.00.10.20.30.4 Fig. 2.

Distribution of the standardised physical parameters of superclusters. From left to right: the total weighted luminosity ofgalaxies L g , the volume and the diameter of superclusters, the density of the highest density peak inside superclusters, D peak, andthe number of galaxies in superclusters, N gal.

3. Principal component analysis

The idea of the principal component analysis is to ﬁnd a smallnumber of linear combinations of correlated parameters to de-scribe most of the variation in the dataset with a small numberof new uncorrelated parameters. The PCA transforms the datato a new coordinate system, where the greatest variance by anyprojection of the data lies along the ﬁrst coordinate (the ﬁrst prin-cipal component), the second greatest variance – along the sec-ond coordinate, and so on. There are as many principal compo-nents as there are parameters, but typically only the ﬁrst few areneeded to explain most of the total variation.Principal components PC x ( x ∈ N , x ≤ N tot ) are a linearcombination of the original parameters: PCx = N tot X i = a ( i ) x V i (4)where − ≤ a ( i ) x ≤ ﬃ cients of the linear transfor-mation, V i are the original parameters and N tot is the number ofthe original parameters.PCA is suitable tool to study simultaneously correlations be-tween a large number of parameters, for ﬁnding subsets in data,and detecting outliers. Linear combinations of principal compo-nents can be used to reproduce parameters characterising objectsin the dataset.Principal components can be used to derive scaling relations.If data points lie along a plane, deﬁned by the ﬁrst two principalcomponents, then the scaling relations along this plane are de-ﬁned by the third principal component (Efstathiou & Fall 1984). For the analysis we use standardised parameters, centred on theirmeans ( V i − V i ) and normalised (divided by their standard devi-ations, σ ( V i )). Therefore we obtain for the scaling relations: N tot X i = a ( i ) ( V i − V i ) σ ( V i ) = . (5)For PCA, the parameters should be normally distributed.Therefore we use the logarithms of parameters in most cases;this makes the distributions more gaussian, and the range overwhich their values span are smaller, especially for luminositiesand volumes. We do not use logarithms of morphological data,in order to not to exclude from the analysis those with negativevalues of shapeﬁnders, which may occur in the case of compactsuperclusters with a complex overall morphology (Einasto et al.2008, 2011b). Figures 2 and 3 show the distribution of the val-ues of the standardised parameters. Deviations from the nor-mal distribution are mostly caused by the most luminous (or bythe poorest for the shape parameter) superclusters in our sam-ple. In Table 1 we give the mean values and standard devia-tions of supercluster parameters. For poor superclusters of “spi-der” morphology the shape parameter is not always well deﬁned(Einasto et al. 2011a). For ﬁve systems the value of the shape pa-rameter | K / K | >

4; therefore we also calculated the mean valueand standard deviation of the shape parameter without these sys-tems (denoted as K / K ∗ ). This e ﬀ ect does not a ﬀ ect the valuesof other parameters, thus we did not exclude these systems fromour calculations.

3. Einasto et al.: PCA

We present in tables the values of principal components andthe standard deviations, proportion of variance, and cumula-tive variance of principal components. The values of compo-nents show the importance of the original parameters in eachPCx. We plot the principal planes for superclusters. For the cal-culations we used command prcomp from R , an open-sourcefree statistical environment developed under the GNU GPL(Ihaka & Gentleman 1996, ).To study correlations between properties of superclusters, weapplied Spearman’s rank correlation test, in which the value ofthe correlation coe ﬃ cient r shows the presence of correlation( r = r = − r ≈ Table 1.

Mean values and standard deviations of superclusterparameters. (1) (2) (3)Parameter mean sd log( L g ) 2.367 0.378log( Volume ) 2.813 0.571log(

Diameter ) 1.179 0.258log( D peak) 0.856 0.119log( N gal) 2.219 0.435log( Dist . ) 2.379 0.113 V K K K / K -0.050 3.701 K / K ∗ Notes. L g – the total weighted luminosity of galaxies in superclustersin units of 10 h − L ⊙ ; Volume – in units of ( h − Mpc) ; Diameter –in Mpc / h ; N gal – the number of galaxies in superclusters; D peak – thedensity of the highest density peak inside superclusters, in units of meandensity; Dist – the distance in Mpc / h ; V is the maximum value of thefourth Minkowski functional, K is the planarity, K is the ﬁlamentar-ity, and the ratio, K / K , is the shape parameter of superclusters (seeSection 2 for deﬁnitions). K / K ∗ denotes the shape parameter for thesupercluster sample from which we excluded ﬁve most noisy values asexplained in the text. V3 d −2.0 0.0 2.0 4.00.00.20.40.6 K1 d −2.0 0.0 2.0 4.00.00.20.40.6 K2 d −2.0 0.0 2.0 4.00.00.20.40.60.8 K1/K2* d −2.0 0.0 2.00.10.20.30.40.5 Fig. 3.

Distribution of the standardised morphological parame-ters of superclusters. From left to right: the maximum value ofthe fourth Minkowski functional V , the planarity K , the ﬁla-mentarity K , and the shape parameter of superclusters, K / K ∗ .

4. Results

We start the calculations of principal components using physicalcharacteristics of superclusters and their distances. Including thesupercluster distances may show possible correlations between the other parameters of superclusters and their distance, whichwill indicate that the parameters of superclusters are a ﬀ ected bydistance-dependent selection e ﬀ ects. Table 2.

Results of the principal component analysis, with thedistances of superclusters included. (1) (2) (3) (4)PC1 PC2 PC3log( N gal) -0.444 0.264 -0.108log( L g ) -0.455 -0.149 -0.097log( Diameter ) -0.441 -0.133 -0.542log(

Volume ) -0.454 -0.126 -0.042log( D peak) -0.427 -0.062 0.825log( Distance ) 0.100 -0.932 0.012Importance of components PC1 PC2 PC3Standard deviation 2.148 1.046 0.466Proportion of Variance 0.769 0.182 0.036Cumulative Proportion 0.769 0.951 0.987

Notes.

Notations given in Section 2.

Table 3.

Results of the Spearman’s rank correlation test. (1) (2) (3)Parameters r p log(

Dist . ) vs. log( L g ) -0.06 0.50log( Dist . ) vs. log( N gal) -0.49 9 . e − Dist . ) vs. log( Diameter ) -0.11 0.20log(

Dist . ) vs. log( Volume ) -0.08 0.40log(

Dist . ) vs. log( D peak) -0.09 0.33log( Dist . ) vs. V -0.03 0.78log( Dist . ) vs. K -0.08 0.37log( Dist . ) vs. K Dist . ) vs. K / K -0.05 0.58log( L g ) vs. log( N gal) 0.88 < . e − L g ) vs. log( Diameter ) 0.95 < . e − L g ) vs. log( Volume ) 0.98 < . e − L g ) vs. log( D peak) 0.94 < . e − L g ) vs. V < . e − L g ) vs. K < . e − L g ) vs. K < . e − L g ) vs. K / K Notes.

Rank correlation coe ﬃ cient r and the p-value p . The values p < .

05 mean that the results are statistically of very high signiﬁcance.

Table 2 presents the results of this analysis. We show thevalues of only the ﬁrst three principal components, enough forthis test. The coe ﬃ cients of the ﬁrst principal component of thephysical parameters are of almost equal value, while the coe ﬃ -cient corresponding to the distance is very small – the ﬁrst prin-cipal component accounts for most of the variance of the physi-cal parameters of superclusters. The second principal componentaccounts for most of the variance of the distances of superclus-ters. This shows that the physical parameters of superclusters arenot correlated with distance. To ensure that this interpretationis correct we carried out the Spearman’s tests for correlations

4. Einasto et al.: PCA (Table 3). These tests showed a weak anticorrelation betweenthe distance and the number of galaxies in superclusters, with ahigh statistical signiﬁcance. This is not surprising since the cat-alogue of superclusters is based on the ﬂux-limited sample inwhich the number of galaxies in superclusters depends on thedistance. The sample of superclusters was chosen from a rela-tively narrow distance interval, so this dependence is weak. Forother parameters of superclusters (luminosity, diameter, volume,and peak density), the tests showed a very weak correlation withdistance (Spearman’s rank r ≈ . p -values show. Therefore we conclude thatthere are no correlations between the distances and physical pa-rameters of superclusters, and the distance-dependent selectione ﬀ ects have been properly taken into account when generatingthe supercluster catalogue and calculating the physical proper-ties of superclusters. Table 4.

Results of the principal component analysis for thephysical parameters. (1) (2) (3) (4) (5) (6)PC1 PC2 PC3 PC4 PC5log( N gal) -0.439 0.056 0.895 -0.036 -0.018log( L g ) -0.460 0.112 -0.217 -0.047 0.851log( Diameter ) -0.445 0.557 -0.238 0.561 -0.344log(

Volume ) -0.458 0.058 -0.268 -0.761 -0.367log( D peak) -0.430 -0.818 -0.149 0.319 -0.144Importance of componentsPC1 PC2 PC3 PC4 PC5St. deviation 2.139 0.467 0.377 0.193 0.161Prop. Variance 0.915 0.043 0.028 0.007 0.005Cum. Proportion 0.915 0.958 0.987 0.994 1.000 Notes.

Notations as in Table 2.

We will proceed with the analysis of superclusters, tak-ing only the physical parameters into account. Table 4, whichpresents the results of this analysis, demonstrates that the co-e ﬃ cients of the ﬁrst principal component are almost equal fordi ﬀ erent parameters of superclusters. Therefore the parameters,which describe the full supercluster (the luminosity, richness,diameter, and volume), are almost equally important in deter-mining the supercluster properties. The cumulative variance inTable 4 shows that the ﬁrst two principal components accountfor more than 95% of the total variance in this supercluster sam-ple. The ﬁrst principal component accounts for most of the vari-ance of the overall parameters of superclusters. The values ofthe second principal component show that the largest remainingvariance in the sample comes from the peak density of super-clusters. The values of the third principal component show thatthe coe ﬃ cients corresponding to the luminosity, volume, and di-ameter have almost equal negative values, while the number ofgalaxies has large positive coe ﬃ cients.The PCA therefore suggests that the physical parameters ofsuperclusters are strongly correlated. We checked for the pres-ence of the correlations between the parameters with Spearman’stests, which showed that the correlations between the parametersof superclusters are statistically of very high signiﬁcance, bothbetween the overall parameters of superclusters and between theoverall parameters and the peak density inside the superclusters(Table 3). We only present the correlations between the lumi-nosity and other parameters, to keep Table 3 short. The results of the tests of other correlations are similar. Especially tight arethe correlations between the luminosities, the diameters, and thevolumes of superclusters, as the correlation coe ﬃ cients show inTable 3. −5.0−2.50.02.5 −5.0 −2.5 0.0 2.5−5.0 −2.5 0.0 2.5−5.0−2.50.02.5 PC2 PC3PC1PC3

Fig. 4.

Principal planes for superclusters, PCA with physical pa-rameters. Open circles: high-luminosity superclusters with lu-minosity L g >

400 10 h − L ⊙ , grey dots: superclusters of lowerluminosity.Let us take a look at the locations of superclusters in the prin-cipal planes (Fig. 4). The upper lefthand panel shows the distri-bution of superclusters in the principal plane PC1-PC2. Most su-perclusters form here an elongated cloud with a very small scat-ter. These are low-luminosity superclusters with the luminosity L g <

400 10 h − L ⊙ . The scatter of positions of high-luminositysuperclusters is larger. This suggests that we can divide super-clusters into two populations according to their total luminosity.The transition between populations is smooth. We give the dataabout high-luminosity superclusters in Table C.1. The luminoussuperclusters with a high value of the peak density have highernegative values for the second PC, and the supercluster SCl 001has the largest negative value of PC2. The superclusters witha lower value of the peak density have positive values of thesecond PC. The richest supercluster in the sample, SCl 061, isamong them. This supercluster has the highest negative value ofPC1. The lefthand panels of Figure 4 show that the more lumi-nous the supercluster, the higher is the negative value of it’s ﬁrstprincipal component. The value of the peak density inside super-clusters determines the location of superclusters along the axisof the second principal component. In PC1-PC3 plane (lowerleft panel of Fig. 4) superclusters also form an elongated cloudwith larger scatter of high-luminosity superclusters. Upper rightpanel (PC3-PC2 plane) shows the third view of this cloud. Suchan elongated, prolate shape is characteristic of the planar distri-bution on PC1-PC2 plane (Woo et al. 2008), which deﬁnes thefundamental plane for superclusters.

5. Einasto et al.: PCA

Table 5.

Results of principal component analysis for the lumi-nosity and morphological properties of superclusters. (1) (2) (3) (4) (5) (6)PC1 PC2 PC3 PC4 PC5log( L g ) -0.489 0.004 -0.655 0.373 -0.437 V -0.490 -0.044 0.596 0.608 0.173 K -0.511 -0.023 -0.297 -0.331 0.734 K -0.505 -0.056 0.351 -0.615 -0.488 K / K Notes.

As in Section 2.

Next, we use the PCA to study the morphological and physi-cal properties of superclusters simultaneously. From the phys-ical characteristics we only include the total luminosity, whichis su ﬃ cient since the physical parameters of superclusters arestrongly correlated. Table 3 shows that the mophological pa-rameters of superclusters are not correlated with their distances.Table 5 shows the results of PCA for the luminosity and themorphological parameters. Here the absolute values of compo-nents for the luminosity, the clumpiness, and the shapeﬁnders K and K are almost equal. Therefore the luminosity and thesemorphological parameters are equally important in shaping theproperties of superclusters. The second principal component ac-counts for most of the variance of the shape parameter K / K .The higher the negative value of the PC1 for the supercluster,the more luminous the supercluster, has higher value planari-ties and ﬁlamentarities, and higher maximal value of the fourthMinkowski functional V , hence a richer inner morphology.Table 5 shows that the ﬁrst two principal components ac-count for about 93% of the total variance in the data.The Spearman’s tests (Table 3) showed that the correlationsbetween the supercluster luminosity and its morphological pa-rameters are statistically highly signiﬁcant. The correlation be-tween the luminosity and the shape parameter of superclusters isweak.Figure 5 presents the locations of superclusters in the prin-cipal planes, deﬁned by the luminosity and morphological pa-rameters of superclusters. The upper lefthand panel shows thedistribution of superclusters in the principal plane PC1-PC2.Here both high- and low-luminosity superclusters form an elon-gated cloud with very small scatter. The scatter of positions ofthe high-luminosity superclusters in PC1-PC3 plane is greater.Again, the more luminous the supercluster, the higher the nega-tive value of its ﬁrst principal component. High values of PC1(and the highest values of PC3) correspond to luminous su-perclusters with high values of clumpiness V (Table 5). Largescatter along the second principal component PC2 in princi-pal planes correspond to superclusters with high values of theshape parameter K / K . These are poor superclusters of “spider”morphology, for which the shape parameter is not well deﬁned(Einasto et al. 2011a). We see that the luminosity and the mor-phological parameters of superclusters also deﬁne a fundamentalplane for superclusters, where the physical and morphologicalproperties are combined. −10.0−7.5−5.0−2.50.02.55.0 −10.0 −7.5 −5.0 −2.5 0.0 2.5 5.0−10.0 −7.5 −5.0 −2.5 0.0 2.5 5.0−10.0−7.5−5.0−2.50.02.55.0 PC2 PC3PC1PC3

Fig. 5.

Principal planes for superclusters. PCA with the morpho-logical parameters. Open circles: high-luminosity superclusterswith luminosity L g >

400 10 h − L ⊙ ; grey dots: superclustersof lower luminosity. Table 6.

Results of principal component analysis for luminosity,diameters, and shapeﬁnders. (1) (2) (3) (4)PC1 PC2 PC3log( L g ) -0.5713 0.7905 -0.2205 K D -0.5833 -0.2020 0.7867 K D -0.5773 -0.5781 -0.5765Importance of componentsPC1 PC2 PC3St.deviation 1.696 0.308 0.165Prop.Variance 0.959 0.031 0.009Cum.Proportion 0.959 0.990 1.000 Notes. log( L g ): logarithm of the total luminosity of superclusters, K D = (1 − K ) · log( Diameter ), and K D = (1 − K ) · log( Diameter ). The results of the PCA suggest that the ﬁrst two princi-pal components deﬁne the fundamental plane for superclus-ters. This motivates us to ﬁnd the scaling relations betweenthe supercluster parameters. The scaling relations have ear-lier been found between the properties of galaxies, of groupsof galaxies and of dark matter haloes (Faber & Jackson 1976;Tully & Fisher 1977; Kormendy 1977; Efstathiou & Fall 1984;Djorgovski & Davis 1987; Dressler et al. 1987; Schae ﬀ er et al.1993; Adami et al. 1998; Lanzoni et al. 2004; D’Onofrio et al.2008; Woo et al. 2008; Araya-Melo et al. 2009b, and referencestherein).For scaling relations we use Eq. (5) and perform the PCAfor the parameters log( L g ), (1 − K ) · log( Diameter ) and (1 − K ) · log( Diameter ). This set combines the easily detectable di-ameter of superclusters, and morphological parameters K and K , which characterise the sizes and the shapes of superclus-

6. Einasto et al.: PCA −2.50.02.5 −2.5 0.0 2.5−2.5 0.0 2.5−2.50.02.5

PC2PC3 PC3PC1

Fig. 6.

Principal planes for superclusters. PCA for the luminos-ity, diameter, and shapeﬁnders as described in the text. Open cir-cles: high-luminosity superclusters with luminosity L g > h − L ⊙ , grey dots: superclusters of lower luminosity.ters, with the total luminosity of superclusters. For low values ofshapeﬁnders, (1 − K ) and (1 − K ) are less noisy than K and K (Einasto et al. 2011a). log Lg(predicted) l og Lg ( ob s e r v ed ) Fig. 7. L g (observed) vs. L g (predicted) , in units of 10 h − L ⊙ . Opencircles denote high-luminosity superclusters with the luminosity L g >

400 10 h − L ⊙ and the shape parameter K / K > . K / K < . L g ) = (5 . K − . K − . · log( D ) + . , (6)where D denotes diameter. The standard deviation for the rela-tion sd = . sd = . sd = . ﬀ erent symbols, according to their shape parameter. Figure 7shows that more elongated and less elongated high-luminositysuperclusters populate the L g (observed) - L g (predicted) plane di ﬀ er-ently. This suggests that luminous superclusters can be dividedinto two populations according to their shapes. Our calculationsshow that there is no such di ﬀ erence for low-luminosity super-clusters. The di ﬀ erences between the observed and predicted lu-minosity are the largest for ﬁve systems with the highest pre-dicted luminosity in Fig. 7. These are very elongated luminoussuperclusters in the sample, systems of (multibranching) ﬁla-ment morphology, SCl 064, SCl 189, SCl 336, and SCl 474,and a multispider SCl 530 (for morphological classiﬁcation ofsuperclusters we refer to Einasto et al. 2011a).Next we derived the scaling relations separately for moreelongated and less elongated high-luminosity superclusters (cor-respondingly, Eq. (7) and Eq. (8)), and for all low-luminositysuperclusters (Eq. (9)):log( L g ) = (0 . K − . K + . · log( D ) + .

69 (7)log( L g ) = (3 . K − . K + . · log( D ) + .

09 (8)log( L g ) = (63 . K − . K − . · log( D ) + .

81 (9)Figure 8 demonstrates the observed vs. predicted luminos-ity of superclusters found with these relations. Now luminositiesof high-luminosity superclusters are recovered well, with a verysmall scatter ( sd = .

16 and sd = .

22 for more elongated andless elongated superclusters). Interestingly, this ﬁgure shows theabsence of the correlation between the observed and predictedluminosity for low-luminosity superclusters. To understand this,we plot in Fig. 9 the shapeﬁnders K − K plane for superclus-ters where the size of symbols is proportional to the diame-ters of superclusters. Here the values of shapeﬁnders for high-luminosity superclusters are correlated, and these superclustersalso have larger sizes. Most low-luminosity superclusters havevery low, uncorrelated values of shapeﬁnders (both K and K < .

7. Einasto et al.: PCA log Lg(predicted) l og Lg ( ob s e r v ed ) Fig. 8. L g (observed) vs. L g (predicted) , in units of 10 h − L ⊙ . Opencircles denote high-luminosity superclusters with the luminosity L g >

400 10 h − L ⊙ and the shape parameter K / K > . L g >

400 10 h − L ⊙ and the shapeparameter K / K < . K1 K Fig. 9.

Shapeﬁnders K − K plane for superclusters. The size ofsymbols is proportional to the diameters of superclusters. Opencircles denote high-luminosity superclusters with the luminos-ity L g >

400 10 h − L ⊙ and the shape parameter K / K > . L g >

400 10 h − L ⊙ and theshape parameter K / K < . K and K > .

5. Selection effects

The main selection e ﬀ ect in our study comes from the use ofa ﬂux-limited sample of galaxies to determine the luminositydensity ﬁeld and superclusters. To have luminosity-dependentselection e ﬀ ects as small as possible, we used data about galaxiesand galaxy systems from a distance interval 90 – 320 h − Mpc,in which these e ﬀ ects are the least (we refer to T10 for details).We showed above that the parameters of superclusters (exceptthe number of galaxies) do not correlate with distance, whichshows that the distant-dependent selection e ﬀ ects are correctlytaken into account when generating the supercluster catalogue.If the number of cells used to deﬁne superclusters is toosmall then the supercluster catalogue may include objects thatcannot be considered as real superclusters. Moreover, the de-tection of the shape parameter becomes unreliable. If the shapeparameter is determined using the inertia tensor method thensuperclusters have to be deﬁned using at least eight members(Kolokotronis et al. 2001). In our study we determine shapeﬁnd-ers with Minkowski functionals, and the minimum number ofcells for deﬁning superclusters is 64 (Appendix A). We anal-ysed systems in a distance interval where the selection e ﬀ ectsare small. Even the poorest systems contain at least 25 to 30galaxies and several groups of galaxies. Therefore the detec-tion of the shape parameter may only be a ﬀ ected weakly bythe selection e ﬀ ects except for the poorest systems of “spider”morphology for which the shapeﬁnders may be noisy. We notethat Costa-Duarte et al. (2011) include systems with at least tenmember galaxies in their supercluster catalogue to study of theshape parameter of superclusters.Another selection e ﬀ ect comes from the choice of the thresh-old density to determine superclusters. At the density level usedin the present paper ( D = . Table 7.

Results of the principal component analysis for thethreshold density level D = . (1) (2) (3) (4) (5) (6)PC1 PC2 PC3 PC4 PC5log N gal -0.437 0.085 0.889 -0.082 0.052log L g -0.460 0.093 -0.146 0.854 -0.166log Diameter -0.447 0.523 -0.282 -0.443 -0.498log

Volume -0.461 0.094 -0.298 -0.150 0.816log D peak -0.428 -0.837 -0.136 -0.208 -0.233Importance of componentsPC1 PC2 PC3 PC4 PC5St.deviation 2.127 0.484 0.406 0.214 0.172Prop.Variance 0.904 0.046 0.033 0.009 0.005Cum.Proportion 0.904 0.951 0.984 0.994 1.000 Notes.

Notations as in Table 2.

To see the sensitivity of the PCA results to the small di ﬀ er-ences in the choice of the threshold density, we compared theresults of the PCA for superclusters chosen at higher and lowerthreshold density levels. As an example we show in Table 7the coe ﬃ cients of the principal components for the superclus-ters chosen at the threshold density level D = .

5. At this den-

8. Einasto et al.: PCA

Table 8.

Results of the Spearman’s rank correlation test for thethreshold density level D = . (1) (2) (3)Parameters r p log( L g ) vs. log( N gal) 0.85 < . e − L g ) vs. log( Diameter ) 0.94 < . e − L g ) vs. log( Volume ) 0.98 < . e − L g ) vs. log( D peak) 0.94 < . e − Notes.

Rank correlation coe ﬃ cient r and the p-value p . sity level, Luparello et al. (2011) determined superclusters in theSDSS-DR7 for volume-limited samples of galaxies. We usedﬂux-limited samples, thus the density levels cannot be compareddirectly, but we can still choose this level for the present test.Table 8 shows the results of the Spearman’s correlation test forthis density level. The comparison with Tables 4 and 3 showsthat the coe ﬃ cients are almost the same. Therefore the resultsof the correlation test and the PCA are not very sensitive to thechoise of the density level.

6. Discussion and conclusions

We studied the properties of superclusters drawn from the SDSSDR7 using the principal component analysis and Spearman’scorrelation test. Several earlier studies have shown that theproperties of superclusters are correlated (see the references inSect. 1). However, it is surprising that the correlations betweenthe various properties of superclusters are so tight. The ﬁrst twoprincipal components account for most of the variance in thedata. Di ﬀ erent physical parameters (the luminosity, volume, anddiameter) and the morphological parameters (the clumpiness andthe shape parameters) are almost equally important in shapingthe properties of superclusters. This suggests that superclusters,as described by their overall physical and morphological prop-erties and by their inner morphology and peak density, are ob-jects that can be described with a few parameters. We derivedthe scaling relation for superclusters in which we combine theirluminosities, diameters, and shapeﬁnders.We saw in Fig. 7 that more elongated and less elon-gated high-luminosity superclusters populate the L g ( observed ) - L g ( predicted ) plane di ﬀ erently. This suggests that luminous su-perclusters can be divided into two populations according totheir shapes – more elongated systems with the shape param-eter K / K < . K / K > . ﬀ erentmultivariate methods reveal information about the data in suchgood agreement. However, there are few high-luminosity super-clusters in our sample. There are 14 systems with the shapeparameter K / K < . K / K > .

5. A larger sample of superclusters has to be anal-ysed to conﬁrm this result.Parameters used to characterise superclusters in the presentstudy do not reﬂect all the properties of superclusters.For example, rich superclusters contain high-density coresthat may contain merging X-ray clusters and may be col-lapsing (Small et al. 1998; Bardelli et al. 2000; Einasto et al.2001; Rose et al. 2002; Einasto et al. 2007c, 2008). A su-percluster environment with a wide range of densities af- fects the properties of galaxies, groups, and clusters lo-cated there (Einasto et al. 2003b; Plionis 2004; Wolf et al.2005; Haines et al. 2006; Einasto et al. 2007d; Porter et al.2008; Tempel et al. 2009; Fleenor & Johnston-Hollitt 2010;Tempel et al. 2011; Einasto et al. 2011b). Einasto et al. (2011b)showed that the dynamical evolution of one of the richest su-perclusters in the Sloan Great Wall (SCL 111, SCl 024 in L10catalogue) is almost ﬁnished, while the richest member of theWall, SCl 126 (SCl 061) is still dynamically active. Thereforeour results reﬂect only certain aspects of the properties of super-clusters.Systems of galaxies determined in the SDSS have beenstudied by a number of authors (Pandey & Bharadwaj 2005;Gott et al. 2005; Park et al. 2005; Pandey & Bharadwaj 2006;Gott et al. 2008; Pandey & Bharadwaj 2008; Kitaura et al. 2009;Choi et al. 2010; Sousbie et al. 2011; Einasto et al. 2011b,a;Sheth & Diaferio 2011; Pimbblet et al. 2011; Platen et al.2011). The overall shapes of superclusters have been de-scribed by the shape parameters or approximated by tri-axial ellipses (Jaaniste et al. 1998; Basilakos et al. 2001;Kolokotronis et al. 2002; Basilakos 2003; Einasto et al. 2007a,2011b,a; Costa-Duarte et al. 2011; Luparello et al. 2011). Thesestudies showed that elongated, prolate structures dominateamong superclusters. The results obtained using the momentsof inertia tensor (Basilakos et al. 2001; Basilakos 2003) orthe Minkowski functionals are in a good agreement (see alsoEinasto et al. 2007e, 2011a). In addition, Basilakos et al. (2006)analysed correlations between supercluster properties from sim-ulations and ﬁnd that the amplitude of the supercluster - clusteralignment increases (weakly) with superclusters ﬁlamentarity.The properties of superclusters are determined by their for-mation and evolution. Kolokotronis et al. (2002) show that theshapes of superclusters agree better with a Λ CDM model thanwith a τ CDM model. Also Luparello et al. (2011) found thatthe shapes of observed superclusters agree with those in the Λ CDM model. In the Λ CDM concordance cosmological model,the matter density Ω m dominated in the early universe andthe structures formed by hierarhical clustering driven by grav-ity. As the universe expands, the average matter density de-creases. At a certain epoch, the dark energy density Ω Λ becamehigher than the matter density, and the universe started to ex-pand acceleratingly. Simulations of the evolution and the fu-ture of the structure in an accelerating universe show the freez-ing of the web – the large-scale evolution of structures slowsdown (Loeb 2002; Nagamine & Loeb 2003; D¨unner et al. 2006;Ho ﬀ man et al. 2007; Krauss & Scherrer 2007, and referencestherein). Araya-Melo et al. (2009a) show that this a ﬀ ects thesizes, the shapes, and the inner structure of superclusters, andthey become rounder, smaller, and their multiplicity decreases.According to our present results, this suggests that in the futuresuperclusters become less elongated and the scatter in the scal-ing relation of superclusters may decrease.Summarising, our study showed that1) The PCA and Spearman’s correlation test showed the ab-sence of correlations between the physical properties ofsuperclusters and their distance, therefore the distance-dependent selection e ﬀ ects were taken into account properlywhen generating supercluster catalogues.2) The correlations between the properties of superclusters aretight. Di ﬀ erent physical parameters (the luminosity, the vol-ume, and the diameter) and the morphological parameters(the clumpiness and the shapeﬁnders) of superclusters areequally important in shaping the properties of superclusters.

9. Einasto et al.: PCA

3) The ﬁrst two principal components account for more than90% of the variance of the supercluster properties and deﬁnethe fundamental plane of superclusters. This suggests thatsuperclusters can be described with a few physical and mor-phological parameters. We derived the scaling relation forsuperclusters using data about their luminosities, diameters,and shapeﬁnders.4) Superclusters can be divided into two populations accord-ing to their luminosity, using the luminosity limit L g = h − L ⊙ . In agreement with Einasto et al. (2011a), weﬁnd that high-luminosity superclusters can be divided intotwo sets: more elongated systems with the shape parameter K / K < . K / K > . ﬀ ected by selection e ﬀ ects. To understand the properties of su-perclusters better the next step is to study a large sample of su-perclusters and high-redshift superclusters. A few superclustersat very high redshifts have already been discovered (Nakata et al.2005; Swinbank et al. 2007; Gal et al. 2008; Tanaka et al. 2009;Planck Collaboration et al. 2011; Schirmer et al. 2011). Deepsurveys like the ALHAMBRA project (Moles et al. 2008) willprovide us with data about (possible) very distant superclusters.We also need more simulations with various cosmologies to un-derstand the evolution and the properties of superclusters in de-tail. Acknowledgements.

We thank the referee, Dr. S. Basilakos, for the com-ments and suggestions that helped to improve the paper. Funding for theSloan Digital Sky Survey (SDSS) and SDSS-II has been the National ScienceFoundation, the U.S. Department of Energy, the National Aeronautics and SpaceAdministration, the Japanese Monbukagakusho, and the Max Planck Society,and the Higher Education Funding Council for England. The SDSS Web site ishttp: // / .The SDSS is managed by the Astrophysical Research Consortium (ARC)for the Participating Institutions. The Participating Institutions are the AmericanMuseum of Natural History, Astrophysical Institute Potsdam, Universityof Basel, University of Cambridge, Case Western Reserve University, TheUniversity of Chicago, Drexel University, Fermilab, the Institute for AdvancedStudy, the Japan Participation Group, The Johns Hopkins University, the JointInstitute for Nuclear Astrophysics, the Kavli Institute for Particle Astrophysicsand Cosmology, the Korean Scientist Group, the Chinese Academy of Sciences(LAMOST), Los Alamos National Laboratory, the Max-Planck-Institute forAstronomy (MPIA), the Max-Planck-Institute for Astrophysics (MPA), NewMexico State University, Ohio State University, University of Pittsburgh,University of Portsmouth, Princeton University, the United States NavalObservatory, and the University of Washington.We acknowledge the Estonian Science Foundation for support under grantsNo. 8005 and 7146, 7765, and the Estonian Ministry for Education andScience support by grant SF0060067s08. This work has also been supported byICRAnet through a professorship for Jaan Einasto, by the University of Valenciathrough a visiting professorship for Enn Saar and by the Spanish MEC projectAYA2006-14056, “PAU” (CSD2007-00060), including FEDER contributions,and the Generalitat Valenciana project of excellence PROMETEO / / R ,an open-source free statistical environment developed under the GNU GPL(Ihaka & Gentleman 1996, ). References

Abazajian, K. N., Adelman-McCarthy, J. K., Ag¨ueros, M. A., et al. 2009, ApJS,182, 543Adami, C., Mazure, A., Biviano, A., Katgert, P., & Rhee, G. 1998, A&A, 331,493Adelman-McCarthy, J. K., Ag¨ueros, M. A., Allam, S. S., et al. 2008, ApJS, 175,297Araya-Melo, P. A., Reisenegger, A., Meza, A., et al. 2009a, MNRAS, 399, 97Araya-Melo, P. A., van de Weygaert, R., & Jones, B. J. T. 2009b, MNRAS, 400,1317Bardelli, S., Zucca, E., Zamorani, G., Moscardini, L., & Scaramella, R. 2000,MNRAS, 312, 540 Basilakos, S. 2003, MNRAS, 344, 602Basilakos, S., Plionis, M., & Rowan-Robinson, M. 2001, MNRAS, 323, 47Basilakos, S., Plionis, M., Yepes, G., Gottl¨ober, S., & Turchaninov, V. 2006,MNRAS, 365, 539Blanton, M. R. & Roweis, S. 2007, AJ, 133, 734Bond, N. A., Strauss, M. A., & Cen, R. 2010, MNRAS, 409, 156Chang, Y.-Y., Chao, R., Wang, W.-H., & Chen, P. 2010, ArXiv: 1009.0030Choi, Y.-Y., Park, C., Kim, J., et al. 2010, ApJS, 190, 181Coppa, G., Mignoli, M., Zamorani, G., et al. 2010, ArXiv: 1009.0723Costa-Duarte, M. V., Sodr´e, Jr., L., & Durret, F. 2011, MNRAS, 411, 1716de Lapparent, V., Geller, M. J., & Huchra, J. P. 1986, ApJL, 302, L1Deeming, T. J. 1964, MNRAS, 127, 493Djorgovski, S. & Davis, M. 1987, ApJ, 313, 59D’Onofrio, M., Fasano, G., Varela, J., et al. 2008, ApJ, 685, 875Dressler, A., Lynden-Bell, D., Burstein, D., et al. 1987, ApJ, 313, 42D¨unner, R., Araya, P. A., Meza, A., & Reisenegger, A. 2006, MNRAS, 366, 803Efstathiou, G. & Fall, S. M. 1984, MNRAS, 206, 453Einasto, J. 2010, in American Institute of Physics Conference Series, Vol.1205, American Institute of Physics Conference Series, ed. R. Ru ﬃ ni &G. Vereshchagin, 72–81Einasto, J., Einasto, M., Saar, E., et al. 2006, A&A, 459, L1Einasto, J., Einasto, M., Saar, E., et al. 2007a, A&A, 462, 397Einasto, J., Einasto, M., Tago, E., et al. 2007b, A&A, 462, 811Einasto, J., H¨utsi, G., Einasto, M., et al. 2003a, A&A, 405, 425Einasto, M., Einasto, J., Tago, E., Dalton, G. B., & Andernach, H. 1994,MNRAS, 269, 301Einasto, M., Einasto, J., Tago, E., M¨uller, V., & Andernach, H. 2001, AJ, 122,2222Einasto, M., Einasto, J., Tago, E., et al. 2007c, A&A, 464, 815Einasto, M., Einasto, J., Tago, E., et al. 2007d, A&A, 464, 815Einasto, M., Jaaniste, J., Einasto, J., et al. 2003b, A&A, 405, 821Einasto, M., Liivam¨agi, L. J., Tago, E., et al. 2011a, A&A, 532, A5Einasto, M., Liivam¨agi, L. J., Tempel, E., et al. 2011b, ApJ, 736, 51Einasto, M., Saar, E., Liivam¨agi, L. J., et al. 2007e, A&A, 476, 697Einasto, M., Saar, E., Mart´ınez, V. J., et al. 2008, ApJ, 685, 83Einasto, M., Tago, E., Jaaniste, J., Einasto, J., & Andernach, H. 1997, A&AS,123, 119Erdo˘gdu, P., Lahav, O., Zaroubi, S., et al. 2004, MNRAS, 352, 939Faber, S. M. & Jackson, R. E. 1976, ApJ, 204, 668Ferreras, I., Pasquali, A., de Carvalho, R. R., de la Rosa, I. G., & Lahav, O. 2006,MNRAS, 370, 828Fleenor, M. C. & Johnston-Hollitt, M. 2010, in Astronomical Society of thePaciﬁc Conference Series, Vol. 423, Astronomical Society of the PaciﬁcConference Series, ed. B. Smith, J. Higdon, S. Higdon, & N. Bastian, 81Gal, R. R., Lemaux, B. C., Lubin, L. M., Kocevski, D., & Squires, G. K. 2008,ApJ, 684, 933Gott, J. R. I., Hambrick, D. C., Vogeley, M. S., et al. 2008, ApJ, 675, 16Gott, J. R. I., Juri´c, M., Schlegel, D., et al. 2005, ApJ, 624, 463Gregory, S. A. & Thompson, L. A. 1978, ApJ, 222, 784Haines, C. P., Merluzzi, P., Mercurio, A., et al. 2006, MNRAS, 371, 55Hatch, N. A., De Breuck, C., Galametz, A., et al. 2011, MNRAS, 410, 1537Ho ﬀ man, Y., Lahav, O., Yepes, G., & Dover, Y. 2007, J. Cosmology Astropart.Phys., 10, 16Ihaka, R. & Gentleman, R. 1996, Journal of Computational and GraphicalStatistics, 5, 299Ishida, E. E. O. & de Souza, R. S. 2011, A&A, 527, A49Ishida, E. E. O., de Souza, R. S., & Ferrara, A. 2011, ArXiv: 1106.1745Jaaniste, J., Tago, E., Einasto, M., et al. 1998, A&A, 336, 35Jeeson-Daniel, A., Dalla Vecchia, C., Haas, M. R., & Schaye, J. 2011, MNRAS,415, L69Joeveer, M., Einasto, J., & Tago, E. 1978, MNRAS, 185, 357Kalinkov, M. & Kuneva, I. 1995, A&AS, 113, 451Kitaura, F. S., Jasche, J., Li, C., et al. 2009, MNRAS, 400, 183Kolokotronis, V., Basilakos, S., & Plionis, M. 2002, MNRAS, 331, 1020Kolokotronis, V., Basilakos, S., Plionis, M., & Georgantopoulos, I. 2001,MNRAS, 320, 49Kormendy, J. 1977, ApJ, 218, 333Krauss, L. M. & Scherrer, R. J. 2007, General Relativity and Gravitation, 39,1545Lanzoni, B., Ciotti, L., Cappi, A., Tormen, G., & Zamorani, G. 2004, ApJ, 600,640Liivam¨agi, L. J., Tempel, E., & Saar, E. 2010, ArXiv: 1012.1989Loeb, A. 2002, Phys. Rev. D, 65, 047301Luparello, H., Lares, M., Lambas, D. G., & Padilla, N. 2011, MNRAS, 415, 964Mart´ınez, V. J., Arnalte-Mur, P., Saar, E., et al. 2009, ApJL, 696, L93Mart´ınez, V. J. & Saar, E. 2002, Statistics of the Galaxy Distribution (Chapman& Hall / CRC, Boca Raton)Mobasher, B., Dickinson, M., Ferguson, H. C., et al. 2005, ApJ, 635, 832

10. Einasto et al.: PCA

Moles, M., Ben´ıtez, N., Aguerri, J. A. L., et al. 2008, AJ, 136, 1325Nagamine, K. & Loeb, A. 2003, New A, 8, 439Nakata, F., Kodama, T., Shimasaku, K., et al. 2005, MNRAS, 357, 1357Ouchi, M., Shimasaku, K., Akiyama, M., et al. 2005, ApJL, 620, L1Pandey, B. & Bharadwaj, S. 2005, MNRAS, 357, 1068Pandey, B. & Bharadwaj, S. 2006, MNRAS, 372, 827Pandey, B. & Bharadwaj, S. 2008, MNRAS, 387, 767Park, C., Choi, Y., Vogeley, M. S., Gott, III, J. R., & Blanton, M. R. 2007, ApJ,658, 898Park, C., Choi, Y., Vogeley, M. S., et al. 2005, ApJ, 633, 11Pimbblet, K. A., Andernach, H., Fishlock, C. K., Roseboom, I. G., & Owers,M. S. 2011, MNRAS, 410, 1837Planck Collaboration, Ade, P. A. R., Aghanim, N., et al. 2011, ArXiv: 1101.2024Platen, E., van de Weygaert, R., Jones, B. J. T., Vegter, G., & Arag´on-Calvo,M. A. 2011, MNRAS, 1062Plionis, M. 2004, in IAU Colloq. 195: Outskirts of Galaxy Clusters: Intense Lifein the Suburbs, ed. A. Diaferio, 19–25Porter, S. C., Raychaudhury, S., Pimbblet, K. A., & Drinkwater, M. J. 2008,MNRAS, 388, 1152Rose, J. A., Gaba, A. E., Christiansen, W. A., et al. 2002, AJ, 123, 1216Saar, E. 2009, in Data Analysis in Cosmology, ed. V. J. Mart´ınez & E. Saar &E. Mart´ınez-Gonzalez & M.-J. Pons-Border´ıa (Springer-Verlag, Berlin), 523–563Saar, E., Mart´ınez, V. J., Starck, J., & Donoho, D. L. 2007, MNRAS, 374, 1030Sahni, V., Sathyaprakash, B. S., & Shandarin, S. F. 1998, ApJL, 495, L5S´anchez Almeida, J., Aguerri, J. A. L., Mu˜noz-Tu˜n´on, C., & de Vicente, A. 2010,ApJ, 714, 487Schae ﬀ er, R., Maurogordato, S., Cappi, A., & Bernardeau, F. 1993, MNRAS,263, L21Schirmer, M., Hildebrandt, H., Kuijken, K., & Erben, T. 2011, A&A, 532, A57Shandarin, S. F., Sheth, J. V., & Sahni, V. 2004, MNRAS, 353, 162Sheth, R. K. & Diaferio, A. 2011, ArXiv: 1105.3378Silverman, B. W. 1986, Density Estimation for Statistics and Data Analysis(Chapman & Hall, CRC Press, Boca Raton)Skibba, R. A. & Maccio’, A. V. 2011, ArXiv: 1103.1641Small, T. A., Ma, C., Sargent, W. L. W., & Hamilton, D. 1998, ApJ, 492, 45Sousbie, T., Pichon, C., & Kawahara, H. 2011, MNRAS, 414, 384Swinbank, A. M., Edge, A. C., Smail, I., et al. 2007, MNRAS, 379, 1343Tago, E., Saar, E., Tempel, E., et al. 2010, A&A, 514, A102Tanaka, M., Finoguenov, A., Kodama, T., et al. 2009, A&A, 505, L9Tempel, E., Einasto, J., Einasto, M., Saar, E., & Tago, E. 2009, A&A, 495, 37Tempel, E., Saar, E., Liivam¨agi, L. J., et al. 2011, A&A, 529, A53Tiit, E. & Einasto, J. 1964, Publications of the Tartu Astroﬁzica Observatory, 34,156Toribio, M. C., Solanes, J. M., Giovanelli, R., Haynes, M. P., & Martin, A. M.2011, ApJ, 732, 93Tully, R. B. & Fisher, J. R. 1977, A&A, 54, 661Venemans, B. P., R¨ottgering, H. J. A., Overzier, R. A., et al. 2004, A&A, 424,L17Wolf, C., Gray, M. E., & Meisenheimer, K. 2005, A&A, 443, 435Woo, J., Courteau, S., & Dekel, A. 2008, MNRAS, 390, 1453Wray, J. J., Bahcall, N. A., Bode, P., Boettiger, C., & Hopkins, P. F. 2006, ApJ,652, 907Zeldovich, I. B., Einasto, J., & Shandarin, S. F. 1982, Nature, 300, 407Zucca, E., Zamorani, G., Scaramella, R., & Vettolani, G. 1993, ApJ, 407, 470 Appendix A: Luminosity density ﬁeld andsuperclusters

To calculate the luminosity density ﬁeld, we calculate the lumi-nosities of groups ﬁrst. In ﬂux-limited samples, galaxies outsidethe observational window remain unobserved. To take into ac-count the luminosities of the galaxies that lie outside the samplelimits also we multiply the observed galaxy luminosities by theweight W d . The distance-dependent weight factor W d was calcu-lated as W d = R ∞ L n ( L )d L R L L L n ( L )d L , (A.1)where L , = L ⊙ . M ⊙ − M , ) are the luminosity limits of theobservational window at a distance d , corresponding to the ab-solute magnitude limits of the window M and M ; we took M ⊙ = .

64 mag in the r -band (Blanton & Roweis 2007). Due totheir peculiar velocities, the distances of galaxies are somewhatuncertain; if the galaxy belongs to a group, we use the groupdistance to determine the weight factor. W e i gh t s Distance ( h -1 Mpc) 1 1.2 1.4 1.6 1.8 2 100 150 200 250 300

Fig. A.1.

Weights used to correct for probable group membersoutside the observational luminosity window.The luminosity weights for the groups of the SDSS DR7 inthe distance interval 90 h − Mpc ≥ D ≤ h − Mpc are plottedas a function of the distance from the observer in Fig. A.1. Themean weight is slightly higher than unity (about 1.4) within thesample limits. When the distance is greater, the weights increaseowing to the absence of faint galaxies. Details of the calculationsof weights are given also in Tempel et al. (2011). In the ﬁnalﬂux-limited group catalogue, the richness of groups decreasesrapidly at distances D > h − Mpc due to selection e ﬀ ects(Tago et al. 2010; Einasto et al. 2011a). This is another reasonto choose for our study superclusters from the distance interval90 h − Mpc ≤ D ≤ h − Mpc where the selection e ﬀ ects areweak. Even the poorest systems in our sample contain severalgroups of galaxies being real galaxy systems comparable to theLocal supercluster.To calculate a luminosity density ﬁeld, we convert the spa-tial positions of galaxies r i and their luminosities L i into spatial(luminosity) densities using kernel densities (Silverman 1986): ρ ( r ) = X i K ( r − r i ; a ) L i , (A.2)where the sum is over all galaxies, and K ( r ; a ) is a kernel func-tion of a width a . Good kernels for calculating densities on aspatial grid are generated by box splines B J . Box splines are lo-cal and they are interpolating on a grid: X i B J ( x − i ) = , (A.3)for any x and a small number of indices that give non-zero valuesfor B J ( x ). We use the popular B spline function: B ( x ) = (cid:16) | x − | − | x − | + | x | −− | x + | + | x + | (cid:17) / . (A.4)The (one-dimensional) B box spline kernel K (1) B of the width a is deﬁned as K (1) B ( x ; a , δ ) = B ( x / a )( δ/ a ) , (A.5)where δ is the grid step. This kernel di ﬀ ers from zero only in theinterval x ∈ [ − a , a ]. It is close to a Gaussian with σ = . x ∈ [ − a , a ], so its e ﬀ ective width is 2 a (see, e.g., Saar2009). The kernel preserves the interpolation property exactly

11. Einasto et al.: PCA for all values of a and δ , where the ratio a /δ is an integer. (Thiskernel can be used also if this ratio is not an integer, and a ≫ δ ;the kernel sums to 1 in this case, too, with a very small error.)This means that if we apply this kernel to N points on a one-dimensional grid, the sum of the densities over the grid is exactly N . The three-dimensional kernel K (3) B is given by the directproduct of three one-dimensional kernels: K (3) B ( r ; a , δ ) ≡ K (1)3 ( x ; a , δ ) K (1)3 ( y ; a , δ ) K (1)3 ( z ; a , δ ) , (A.6)where r ≡ { x , y , z } . Although this is a direct product, it isisotropic to a good degree (Saar 2009).In Einasto et al. (2007e) we compared the Epanechnikov, theGaussian, and B box spline kernels for calculating the densityﬁeld. The Epanechnikov and the B kernels are both compact,while the Gaussian kernel is inﬁnite and has to be cut o ﬀ at aﬁxed radius, which introduces an extra parameter. We also foundthat both the Epanechnikov and the B kernels describe the over-all shape of superclusters well, while the B box spline kernelresolves the inner structure of superclusters better. This is whywe used this kernel in the present study.The densities were calculated on a cartesian grid based onthe SDSS η , λ coordinate system, as it allowed the most e ﬃ cientﬁt of the galaxy sample cone into a brick. Using the rms veloc-ity σ v , translated into distance, and the rms projected radius σ r from the group catalogue (T10), we suppress the cluster ﬁngerredshift distortions. We divide the radial distances between thegroup galaxies and the group centre by the ratio of the rms sizesof the group ﬁnger: d gal , f = d group + ( d gal , i − d group ) σ r /σ v . (A.7)This removes the smudging e ﬀ ect the ﬁngers have on the densityﬁeld.The grid coordinates are calculated according to Eq.3. Weused an 1 h − Mpc step grid and chose the kernel width a = h − Mpc. This kernel di ﬀ ers from zero within the radius16 h − Mpc, but signiﬁcantly so only inside the 8 h − Mpc ra-dius. As a lower limit for the volume of superclusters we usedthe value ( a / h − Mpc (64 grid cells). In this way we ex-clude small spurious density ﬁeld objects which include almostno galaxies. Liivam¨agi et al. (2010) tested the method generatingthe superclusters from the Millenium simulations. This compari-son showed that supercluster algorithms work well, and, in addi-tion, the selection e ﬀ ects have been properly taken into accountwhen generating a supercluster catalogue from ﬂux-limited sam-ple of galaxies.Before extracting superclusters we apply the DR7mask constructed by P. Arnalte-Mur (Mart´ınez et al. 2009;Liivam¨agi et al. 2010) to the density ﬁeld and convert densitiesinto units of mean density. The mean density is deﬁned asthe average over all pixel values inside the mask. The maskis designed to follow the edges of the survey and the galaxydistribution inside the mask is assumed to be homogeneous. Appendix B: Minkowski functionals andshapeﬁnders

The supercluster morphology is fully characterised by the fourMinkowski functionals V – V . For a given surface the fourMinkowski functionals (from the ﬁrst to the fourth) are propor-tional to the enclosed volume V , the area of the surface S , the integrated mean curvature C , and the integrated Gaussian curva-ture χ (Sahni et al. 1998; Mart´ınez & Saar 2002; Shandarin et al.2004; Saar et al. 2007; Saar 2009).With the ﬁrst three Minkowski functionals, we calculatethe dimensionless shapeﬁnders K (planarity) and K (ﬁla-mentarity) (Sahni et al. 1998; Shandarin et al. 2004). See alsoBasilakos et al. (2001), in this study the shapeﬁnders were deter-mined with the moments of inertia method. First we calculate theshapeﬁnders H – H with a combination of Minkowski function-als: H = V / S (thickness), H = S / C (width), and H = C / π (length). Then we use the shapeﬁnders H – H to calculate twodimensionless shapeﬁnders K (planarity) and K (ﬁlamentar-ity): K = ( H − H ) / ( H + H ) and K = ( H − H ) / ( H + H ). Wecharacterise the overall shape of superclusters using planarity K and ﬁlamentarity K , and their ratio, K / K (the shape parame-ter).The fourth Minkowski functional V , describes the topol-ogy of the surface and gives the number of isolated clumps,the number of void bubbles, and the number of tunnels (voidsopen from both sides) in the region (see, e.g. Saar et al. 2007).Morphologically the superclusters with low values of the fourthMinkowski functional V can be described as simple spiders orsimple ﬁlaments. High values of the fourth Minkowski func-tional V suggest a complicated (clumpy) morphology of a su-percluster, described as multispiders or multibranching ﬁlaments(Einasto et al. 2007e, 2011a). Appendix C: Data on luminous ( L g >

400 10 h − L ⊙ ) superclusters

12. Einasto et al.: PCA

Table C.1.

Data on luminous ( L g >

400 10 h − L ⊙ ) superclusters (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) (13)ID ID Distance L g N gal Volume Diameter D peak V K K K / K ID E Mpc / h h − L ⊙ ( h − Mpc) Mpc / h + +

009 264 1591.5 1038 8435 50 21.6 2 0.080 0.152 0.527 16210 239 + +

003 111 680.2 1463 3378 22 16.4 1 0.038 0.015 2.456 16011 227 + +

007 233 1476.0 1222 8065 35 16.7 4 0.053 0.049 1.081 15424 184 + +

007 230 1768.2 1469 10040 56 14.1 5 0.089 0.145 0.616 11138 167 + +

007 224 660.7 586 3243 22 13.8 2 0.023 0.040 0.593 9555 173 + +

008 242 1773.0 1306 9684 50 12.3 5 0.091 0.179 0.509 11160 247 + +

002 92 527.4 1335 2472 21 12.0 2 0.013 0.021 0.645 16061 202-001 +

008 255 4315.3 3056 23475 106 12.9 13 0.126 0.459 0.274 12664 250 + +

010 301 1305.4 619 6058 55 12.6 4 0.091 0.229 0.399 16487 215 + +

007 213 477.8 445 2301 21 11.0 2 0.039 0.026 1.49494 230 + +

006 215 2263.4 1830 11256 54 11.1 8 0.113 0.399 0.284 158129 170 + +

010 309 526.7 223 2321 20 10.6 3 0.029 0.048 0.612136 189 + +

007 212 523.2 504 2590 20 10.9 2 0.027 0.030 0.925 271152 230 + +

010 301 907.5 423 4756 32 10.8 3 0.057 0.097 0.585 160189 126 + +

009 267 771.0 433 3063 43 9.5 4 0.070 0.190 0.372195 134 + +

009 280 487.9 273 2200 23 9.9 2 0.031 0.031 1.004198 152-000 +

009 284 863.9 473 4448 38 9.7 4 0.050 0.103 0.490 82223 187 + +

008 268 703.7 462 3368 33 9.3 3 0.051 0.142 0.361 111228 203 + +

007 210 644.0 643 3361 31 9.5 2 0.040 0.040 0.992 133327 170 + +

010 302 419.8 205 1747 20 8.5 2 0.016 0.071 0.228332 175 + +

009 291 664.3 333 3128 27 8.2 3 0.062 0.078 0.788 106336 172 + +

007 207 1003.6 1005 4605 53 8.7 5 0.082 0.246 0.332 109349 207 + +

006 188 768.8 893 3942 42 8.8 4 0.064 0.105 0.610 138350 230 + +

003 105 436.3 955 1987 22 8.0 2 0.022 0.059 0.383 160351 207 + +

007 225 689.1 615 3292 32 8.7 4 0.056 0.086 0.647 138366 217 + +

010 300 763.4 353 3681 31 8.1 4 0.064 0.156 0.409 158376 255 + +

008 258 658.0 437 3097 27 8.6 4 0.050 0.041 1.228 167474 133 + +

008 251 612.6 389 2299 43 7.6 4 0.068 0.223 0.307 76512 168 + +

007 227 410.7 371 1658 26 7.5 3 0.040 0.082 0.490 91530 192 + +

010 306 790.3 333 3690 40 7.5 4 0.084 0.207 0.409827 189 + +

008 254 572.4 405 2238 30 6.7 4 0.052 0.116 0.450

Notes.

Columns are as follows: 1: ID in L10 catalogue; 2: supercluster ID (AAA + BBB + ZZZ, AAA – R.A., +/ -BBB – Dec., CCC – 100 z ); 3: thedistance of the supercluster; 4: the total weighted luminosity of galaxies in the supercluster, L g; 5: the number of galaxies in a supercluster, N gal;6: the volume of the supercluster, Volume ; 7: the supercluster diameter,

Diameter (the maximum distance between galaxies in the supercluster); 8:the peak density D peak of the supercluster, in units of mean density; 9: the maximum value of the fourth Minkowski functional, V (clumpiness),for the supercluster; 10 – 12: shapeﬁnders K (planarity) and K (ﬁlamentarity), and the ratio of the shapeﬁnders K / K of the full supercluster.13: ID E01