[PDF] A new model for virtual machine migration in virtualized cluster server based on Fuzzy Decision Making

Abstract

In this paper, we show that performance of the virtualized cluster servers could be improved through intelligent decision over migration time of Virtual Machines across heterogeneous physical nodes of a cluster server. The cluster serves a variety range of services from Web Service to File Service. Some of them are CPU-Intensive while others are RAM-Intensive and so on. Virtualization has many advantages such as less hardware cost, cooling cost, more manageability. One of the key benefits is better load balancing by using of VM migration between hosts. To migrate, we must know which virtual machine needs to be migrated and when this relocation has to be done and, moreover, which host must be destined. To relocate VMs from overloaded servers to underloaded ones, we need to sort nodes from the highest volume to the lowest. There are some models to finding the most overloaded node, but they have some shortcomings. The focus of this paper is to present a new method to migrate VMs between cluster nodes using TOPSIS algorithm - one of the most efficient Multi Criteria Decision Making techniques- to make more effective decision over whole active servers of the Cluster and find the most loaded serversTo evaluate the performance improvement resulted from this model, we used cluster Response time and Unbalanced Factor.

Full PDF

JJOURNAL OF TELECOMMUNICATIONS, VOLUME 1, ISSUE 1, FEBRUARY 2010 40

A new model for virtual machine migration in virtualized cluster server based on Fuzzy Decision Making

M.Tarighi, S.A.Motamedi, S.Sharifian

Abstract —In this paper, we show that performance of the virtualized cluster servers could be improved through intelligent decision over migration time of Virtual Machines across heterogeneous physical nodes of a cluster server. The cluster serves a variety range of services from Web Service to File Service. Some of them are CPU-Intensive while others are RAM-Intensive and so on. Virtualization has many advantages such as less hardware cost, cooling cost, more manageability. One of the key benefits is better load balancing by using of VM migration between hosts. To migrate, we must know which virtual machine needs to be migrated and when this relocation has to be done and, moreover, which host must be destined. To relocate VMs from overloaded servers to underloaded ones, we need to sort nodes from the highest volume to the lowest. There are some models to finding the most overloaded node, but they have some shortcomings. The focus of this paper is to present a new method to migrate VMs between cluster nodes using TOPSIS algorithm - one of the most efficient Multi Criteria Decision Making techniques- to make more effective decision over whole active servers of the Cluster and find the most loaded serversTo evaluate the performance improvement resulted from this model, we used cluster Response time and Unbalanced Factor.

Index Terms —Cluster server, Migration, MCDM, TOPSIS —————————— (cid:1) —————————— I NTRODUCTION perating system virtualization has attracted consider-able interest in recent years; particularly from the data center and cluster computing communities [6]. Data cen-ters have become popular in a variety of domains such as web hosting, enterprise systems, and e-commerce sites [50]. Server resources in a data center are multiplexed across multiple applications--each server runs one or more applications. Further, each application sees dynamic workload fluctuations caused by incremental growth, time-of-day effects, and flash crowds [1]. Process migration, a hot topic in systems research during the 1980s [10, 11, 12, 13, 14], has seen very little use for real-world applications. Milojicic et al [65] give a tho-rough survey of possible reasons for this, including the problem of the residual dependencies that a migrated process retains on the machine from which it migrated. These are undesirable because the original machine must remain available, and because they usually negatively impact the performance of migrated processes. Another Applicable approach for reducing management complexity is to employ virtualization. In this approach, applications run on virtual servers that are constructed using virtual machines, and one or more virtual servers are mapped onto each physical server in the system. Vir-tualization of data center resources provides numerous benefits. It enables application isolation since malicious or greedy applications can not impact other applications co-located on the same physical server. It enables server con-solidation and provides better multiplexing of data center resources across applications. Perhaps the biggest advan-tage of employing virtualization is the ability to flexibly remap physical resources to virtual servers in order to handle workload dynamics. Migration is transparent to the applications and all modern virtual machines support this capability [6,20]. A workload increase can be handled by increasing the resources allocated to a virtual server, if idle resources are available on the physical server, or by migrating the vir-tual server to a less loaded physical server. The Xen migration work [6] showed that virtual machine migration can enable robust and highly responsive provi-sioning in data centers. What is missing is a convincing vali-dation and algorithms to effect migration, The idea of process migration was first investigated in the 80's [27]. Support for migrating groups of processes across ———————————————— • M.Tarighi is with the Electrical & Electronics Engineering Department, University of AMIR KABIR, High Performance Computing Lab., Tehran, Iran. • S.A.Motamedi is with the Electrical & Electronics Engineering Depart-ment, University of AMIR KABIR, High Performance Computing Lab., Tehran, Iran. • S.Sharifian is with the Electrical & Electronics Engineering Department, University of AMIR KABIR, High Performance Computing Lab., Tehran, Iran. • o OSes was presented in [16], but applications had to be sus-pended and it did not address the problem of maintaining open network connections. Virtualization support for com-modity operating systems in [7] led towards techniques for virtual machine migration over long time spans, suitable for WAN migration [24]. More recently, Xen [6] and VM-Ware [20] have implemented ``live'' migration of VMs that involve extremely short downtimes ranging from tens of milliseconds to a second. VM migration has been used for dynamic resource allocation in Grid environments [23,26 ,8]. A system employing automated VM migrations for scientific nano-technology workloads on federated grid environments was investigated in [23]. The Shirako system provides infra-structure for leasing resources within a federated cluster environment and was extended to use virtual machines for more flexible resource allocation in [8]. Shirako uses migra-tions to enable dynamic placement decisions in response to resource broker and cluster provider policies. In contrast, we focus on data center environments with stringent SLA re-quirements that necessitate highly responsive migration algorithms for online load balancing. VMware's Distributed Resource Scheduler [29] uses migration to perform auto-mated load balancing in response to CPU and memory pres-sure. DRS uses a userspace application to monitor memory usage similar to Sandpiper's gray box monitor, but unlike Sandpiper, it cannot utilize application logs to respond di-rectly to potential SLA violations or to improve placement decisions. Dedicated hosting is a category of dynamic provisioning in which each physical machine runs at most one application and workload increases are handled by spawning a new replica of the application on idle servers. Physical server granularity provisioning has been investigated in [1,22]. Techniques for modeling and provisioning multi-tier Web services by allocating physical machines to each tier are pre-sented in [28]. Although dedicated hosting provides com-plete isolation, the cost is reduced responsiveness - without virtualization, moving from one physical machine to another takes on the order of several minutes [28] making it unsuita-ble for handling flash crowds. Our current implementation does not replicate virtual machines, implicitly assuming that PMs are sufficiently provisioned. Shared hosting is the second variety of dynamic provision-ing, and allows a single physical machine to be shared across multiple services. Various economic and resource models to allocate shared resources have been presented in [5]. Me-chanisms to partition and share resources across services include [2,5]. A dynamic provisioning algorithm to allocate CPU shares to VMs on a single physical machine (as op-posed to a cluster) was presented and evaluated through simulations in [19]. In comparison to the above systems, our work assumes a shared hosting platform and uses VMs to partition CPU, memory, and network resources, but addi-tionally leverages VM migration to meet SLA objectives. Estimating the resources needed to meet an appliction's SLA requires a model that inspects the request arrival rates for the application and infers its CPU, memory, and network bandwidth needs. Developing such models is not the focus of this work and has been addressed by several previous efforts such as [17,1]. Our work is to present a new model to migrate VMs be-tween cluster nodes using TOPSIS algorithm to make de-cision over whole active servers of the data center and find the most loaded server.We have implemented our techniques using the Xen virtual machine [3]. We conduct a detailed experimental evaluation on cluster servers us-ing a mix of CPU-, network- and memory-intensive ap-plications. Results show that our techniqe imposes some overheads rather than Sandpiper but it makes better mi-gration decisions when hotspot occurs. In order to introduce the suggested fuzzy decision mak-ing model based on TOPSIS, firstly, disadvantages of the existing method of ranking the servers will be presented. Then, the basic concepts of the technique for order prefe-rence by similarity to an ideal solution1 and the Fuzzy TOPSIS are mentioned. Moreover, the fuzzy decision making2 software tool is introduced. An application of the FDM software tool is carried out through case studies. The rest of this paper is structured as follows. Section 2 presents some existing methods of finding most loaded servers and their limitations, and Sections 3-6 present our designed algorithm. Section 7 presents our implementa-tion and evaluation. Finally, Sections 8 and 9 present background and related Work and our conclusions, re-spectively. B ACKGROUND AND R ELATED W ORK

Detecting workload hotspots and initiating a migration is currently handled manually. Manually-initiated migra-tion lacks the ability to respond quickly to sudden work-load changes. To address this challenge, studies automated strategies for virtual machine migration in large data centers. The proposed techniques automate the tasks of monitoring system resource usage, hotspot detection, determining a new mapping and initiating the necessary migrations. Migration is further complicated by the need to consider multiple resources-CPU, network,and memory-for each application and physical server [67]. TOPSIS FDM Sandpiper implements a hotspot detection algorithm that determines when to migrate virtual machines, and a hotspot mitigation algorithm that determines what and where to migrate [67]. Upon hotspot detection, Sandpi-per's migration manager is invoked for hotspot mitiga-tion. The migration manager employs provisioning tech-niques to determine the resource needs of overloaded VMs and uses a greedy algorithm to determine a se-quence of moves or swaps to migrate overloaded VMs to underloaded servers. The migration manager determines which VMs need to be migrated. The basic idea is to move load from the most overloaded servers to the least-overloaded servers, while attempting to minimize data copying incurred during migration. Since a VM or a server can be overloaded along one or more of three dimensions– CPU, network and memory–it defined a new metric that captures the combined CPU-network-memory load of a virtual and physical server. The volume of a physical or virtual server is defined as the product of its CPU, network and memory loads: (1) Where CPU, net and mem are the corresponding utiliza-tions of that resource for the virtual or physical server. The higher the utilization of a resource, the greater the volume; if multiple resources are heavily utilized the above product results in a correspondingly higher vo-lume. Unfotunately, In such numerical ranking methods, the influence of each parameter is verified separately and the mutual effects of parameters are ignored.Also in these , all the criteria are assumed to have equal weights in de-cision making, but considering the status of each parame-ter makes such an assumption unrealistic.Fore example, when the virtual machines are web server and because the Web servers are cpu-intensive load, the propoabilty of CPU saturation is more than RAM saturation. In the other word, the weight of CPU and RAM influence are not equal. VMware has added OS migration support, dubbed VMo-tion, to their VirtualCenter management software [69]. VMware Distributed Resource Scheduler improves re-source allocation across all hosts and resource pools in a cluster. When a cluster is enabled for DRS, VirtualCenter continuously monitors the distribution of CPU and mem-ory resource usage for all hosts and virtual machines in that cluster. DRS compares these metrics to ideal resource utilization—that is, the virtual machines’ entitlements. These entitlements are determined based on the resource policies of virtual machines in the cluster and their cur-rent demands. VirtualCenter uses this analysis to perform initial placement of virtual machines, virtual machine migration for load balancing, enforcement of rules and policies [68]. As the number of virtual machines and host increases, overhead imposed by balancing migration in-creases too. Moreover; the cluster which is setup to run DRS consists of a homogenous configuration of hosts. This assumption simplify the decision algorithm and re-duse the overhead. Another negative point of DRS is that DRS does not make virtual machine placement decisions based on their usage of I/O resources. VMware's DRS will use algorithms based on CPU and memory to decide how to balance the hosts. If some I/O-intensive work-loads share a single host, they might saturate the host’s I/O capacity, leading to performance degradation.In the other word, parameters which algorithm of sorting physi-cal node considered are restricted to CPU and RAM sta-tus. In addition,As this is commercial software and strict-ly disallows the publication of third-party benchmarks, we are only able to infer its behavior through VMware's own publications. These limitations make a thorough technical comparison impossible. However, based on the VirtualCenter User's Manual [66], their approach is gen-erally similar to XEN live migration [6] and would expect it to perform to a similar standard.

3 M

ETHODOLOGY AND THE PROPOSED MODEL

Considering limitations and disadvantages of existing method of calculating criticality of cluster servers, efforts have been made in order to develop decision making models for sorting physical nodes. The existing decision making models for server selection are useful but have restricted applications. These methods cannot deal with decision maker ambiguities, uncertain-ties and vagueness, which cannot be handled by crisp values. Having to use crisp values is one of the important problematic points in their process. In this article, the concept of the approach used for sorting problem is based on the fuzzy technique for order preference by si-milarity to ideal solution (Fuzzy TOPSIS). This is because four advantages are addressed: (1) a sound logic that represents the rationale of human choice, (2) a scalar val-ue that accounts for both the best and worst alternatives simultaneously, (3) a simple computation process that can be easily programmed, and (4) the performance measures of all alternatives on attributes can be visualized. These advantages make TOPSIS a major decision making tech-nique as compared with other related techniques such as AHP[26, 27]. The disadvantages of the AHP technique are that it focuses mainly on the decision maker who has to make many pair-wise comparisons to reach a decision, while possibly using subjective preferences. Furthermore, an important disadvantage of the AHP method is the ar-tificial limitation of the use of the nine-point scale. For instance, if Alternative A is five times more important than Alternative B, which in turn is five times more im-portant than Alternative C, a serious evaluation problem arises. The Saaty method[32] cannot cope with the fact that Alternative A is twenty five times more important than Alternative C[33]. The methodology is useful only when the decision making framework has a unidirection-al hierarchical relationship among decision levels. More-over, AHP is not practically usable if the number of alter-natives and criteria are large since the repetitive assess-ments may cause fatigue in the decision maker[34,35]. Our algorithm have a hierarchial stracture. A supervisor program is monitoring status of all nodes based on usage statistics gathered from various physical servers. Work-load of each node varies over time and may cross the threshold. So, to avoid server saturation and service down time the program check the physical host to find overloaded ones. Upon hotspot detection in a node. The level two program will be run. After finding the appro-priate VM from the most loaded node identified in level 1 program, The third program which is responsible to mi-grate VMs operates automatically.All of these programs encapsulated in a software package mplemented in DOM 0 of the control node. We assumed that the cluster is heterogeneous; therefore, lots of parameters like CPU clock speed, RAM capacity, RAM usage, NET usage, operating temperature… of hosts and virtual machines are used to make decision. Recent methods for decision making processes have enabled decision-makers to decide more quickly, easily and sensitively [30]. Therefore; we can apply them to con-stract a new model to choose the best virtual machine to be moved from overloaded to underloaded servers to better load balancing. The deterministic explanation in numerical ranking methods can be mentioned as the most important disadvantage, because in the method offered by Sandpiper, parameters are explained by crisp values. But some parameters can be linguistic statements. For example, for temperature of each cluster node, linguistic terms are better to be used. Therfore; we divided Parame-ters affecting on node ranking into three classes of crisp , linguistic and fuzzy parameters to add more flexibility in deciding. TOPSIS is a popular approach to the MCDM method and has been widely used in the literature (Abo-Sinna and Amer [37]; Agrawal et al.[38]; Cheng et al.[39]; Deng et al.[40]; Feng and Wang[41, 42]; Hwang and Yoon[43]; Jee and Kang[44]; Kim et al.[45]; Lai et al.[46]; Liao[47]; Ol-son[48]; Opricovic and Tzeng[49]; Parkan and Wu[50,51]; Tong and Su[52]; Tzeng et al.[53]; Zanakis et al.[54]). The method has also been extended to deal with Fuzzy MCDM problems. For example, Chen [55] first converted a fuzzy MCDM problem into a crisp one via centroid de-fuzzification and then solved the nonfuzzy MCDM prob-lem using that method. Chen and Tzeng [56] transformed a fuzzy MCDM problem into a nonfuzzy MCDM using a fuzzy integral. Instead of using distance, they employed the grey relation grade to define the relative closeness of each alternative. Chu [57, 58] and Chu and Lin [59] also changed a fuzzy MCDM problem into a crisp one and solved the crisp MCDM problem using the method. Dif-fering from the others, they first derived the membership functions of all the weighted rankings in a weighted nor-malization decision matrix using interval arithmetics of fuzzy numbers and then defuzzified them into crisp val-ues using the ranking method of mean of removals (Kaufmann and Gupta [60]). Chen[55] extended the method to fuzzy group decision making situations by defining a crisp Euclidean distance between any two fuzzy numbers. Triantaphyllou and Lin [61] developed a fuzzy version of the method based on fuzzy arithmetic operations, which led to a fuzzy relative closeness for each alternative proposed by Wang and El-hag [62]. The TOPSIS method is a technique for order preference by similarity to ideal solution and proposed by Hwang and Yoon [43]. The ideal solution (also called the positive ideal solution) is a solution that maximizes the benefit criteria/attributes and minimizes the cost crite-ria/attributes, whereas the negative ideal solution (also called the anti-ideal solution) maximizes the cost crite-ria/attributes and minimizes the benefit crite-ria/attributes. The so-called benefit criteria/attributes are those for maximization, while the cost criteria/attributes are those for minimization. The best alternative is the one that is closest to the ideal solution and farthest from the negative ideal solut ion. Suppose a MCDM problem with m alternatives, A1,...,Am, and n decision crite-ria/attributes, C1,...,Cn. Each alternative is evaluated with respect to m criteria/attributes. All the val-ues/ratings assigned to the alternatives with respect to each criterion form a decision matrix denoted by X = (xij) nxm. Let W = (w1,.., wn) be the relative weight vector for the criteria, satisfying Σ ni=1wi = 1, then the method can be summarized as follows[34] deterministic a) Calculate the decision matrix (D) as: (2) b) Calculate the normalized decision matrix or R matrix. The normalized value ij r is calculated as: (3) ∑ = = mi ijijij rrr (4) c) Calculate the criteria weighted matrix as: (5) d) Calculate the weighted normalized decision matrix. The weighted normalized value Σij is calculated as: (6) Where wj is the weight of the jth criterion and =1 e) Determine the positive ideal and negative ideal solu-tion, A+ and A– respectively. (7) (8) Where I is associated with benefit criteria, and J is asso-ciated with cost criteria. f) Calculate the separation measures, using the ndimen-sional Euclidean distance. The distance of each alternative from the ideal solution is given as: (9) Similarly, the distance from the negative ideal solution is given as: (10) g) Calculate the relative closeness to the ideal solution. The relative closeness of the alternative Aj with respect to A+ is defined as: (11) Since d– J ≥ 0 and d +J ≥ 0, then clearly RCj [0,1] . h) Rank the alternatives according to the relative close-ness to the ideal solution: the higher RCj, the better alter-native Aj. [62]. The fuzzy theory is a modern theory, which was pro-posed by Zadeh [63]. In classic logic, events have two values: to be or not to be, to exist or not to exist, black or white, and one or zero. But in fuzzy logic, in order to an-swer to events, a consistent spectrum is considered be-tween ‘to exist’ and ‘not to exist’ and world phenomena are seen as gray—neither black nor white. The use of fuzzy theory allows us to incorporate unquantifiable in-formation, incomplete information, non-obtainable in-formation, and partial facts into the decision model. The fuzzy decision matrix (D~) and criteria weighted (W ~) can be concisely expressed in matrix format as: (12) (13) where x~ij, i = (1,2,...,m), j = (1,2,...,n)and w ~ j, j = (1,2,...,n) are fuzzy numbers, x~ij = (aij, bij, cij) and w ~ j = (wj1, wj2, wj3). That x~ ij is the performance rating of the ith alternative, Ai, with respect to the jth criteria, Cj and w ~j represents the weight of the jth attribute, Cj. The nor-malized fuzzy decision matrix denoted by R~is shown as: (14) If x~ij = (aij, bij, cij), i = (1,2,...,m) and j = (1,2,...,n) are tri-angular fuzzy numbers, then the normalization process can be conducted by:[62] (15) (16) Where Σb and Σc are the sets of benefit criteria and cost criteria, respectively and c+ j = max I cij, j Σ Σb and a– j = min I aij, j Σ Σc. The weighted fuzzy normalized decision matrix is shown as: (17) The fuzzy positive-ideal (A+) and the fuzzy negativeideal (A–) solutions are shown as: (18) (19) The distance of each alternative from A+ and A– can be currently calculated using Equations [19] and [20]. (20) (21) If a~ = (a1, a2, a3) and b~= (b1, b2, b3) are two triangular fuzzy numbers, then the vertex method is used to calcu-late the distance between them and is calculated as: (22) At the end, the relative closeness of each alternative to the ideal solution is calculated as below: (23) RCi is then used to rank the alternatives. The higher the RCi, the higher criticality of physical server. The higher value of RCi indicates that an alternative is closer to the positive ideal solution and farther from the negative ideal solution simultaneously. A value of 1 (or 100 per cent) for an alternative indicates that the alternative is equal to the positive ideal solution and a value of 0 (or 0 per cent) is equal to the negative ideal solution. The best alternative is the one with the greatest relative closeness to the positive ideal solution. software tool The fuzzy decision making (FDM) software tool[55] has been prepared to make decisions onsidering specific crite-ria and the effect of qualitative parameters and in the sit-uation where the decision maker does not have access to precise information, by Meamareiani at the engineering faculty of Tarbiat-modares University in Iran. The FDM software tool has been designed based on the Fuzzy TOPSIS technique, presented in the previous section. In this section, it is attempted to remove the hotspot in exist-ing cluster nodes by using this software tool and applying it systematically. The most important advantage of apply-ing this software tool is its ability in cases where diversity of data exists .

5 A

NEW MODEL FOR FINDING MOST LOADED SERVER IN CLUSTER SERVER

So far, the problems with methods to sort physical server have been discussed. Now our technique applying fuzzy decision making software tool in the process of selecting the most overloaded server is presented. The main pur-pose of this paper is to present a fuzzy multi-criteria deci-sion making model to sort physical nodes from the most to the least loaded. The basic idea is to move load from the most overloaded servers to the least-overloaded servers, while attempting to minimize data copying incurred during migration. A server can be overloaded along one or more dimensions– CPU, network , memory, temperature and ….if multiple resources of a nodeare heavily utilized, the correspond-ing serever position is near to the top in the sorting table and results in a correspondingly higher score. The vo-lume captures the degree of (over)load along multiple dimensions in a weighted fashion and can be used by the mitigation algorithms to handle all resource hotspots. This paper presents a hierarchical process based on TOP-SIS that operated in two steps. In step one, all physical FDM nodes of the Cluster are being compared to each other. After applying decision algorithm, nodes are sorted fram the highest overloaded to the idle ones (if available). Some parameters used in this stage are shown in Table 1. Every node will be labled with a number between 0 and 1(0 means idle server and 1 indicates saturated server). If there is a node beaking the threshold we triy to mitigate its load by migration the most appropriate Virtual Ma-chine runing on that machine to the least loded node de-termined in the previous step. At the next level we come to the most critical node and apply the fuzzy decision algorithm for the second time. The result of this level will be finding the best VM candi-date for migration. In this step, we use a new set of para-meters that some of them are not exist in the first stage (Table2). T ABLE FIRST LEVEL PARAMETERS

For example, in Table I, the variety of the data that can be explained to illustrate the server condition has been of-fered. In tests, five parameters were used. The first one is the CPU usage of every node in percent.Because of as-suming the cluster heterologous,in addition to the CPU usage, the clock speed of the processors is important too. For more explanation, considering two hetrogenous1.8 GHZ and 3.2 GHZ physical nodes with two different CPU usage (fore example 60% and 75%), the fisrt one is more dangerous and occurring saturation is more possible al-though its utilization hs lower the second node. If compa-rision made overe the utilization of CPU parameter only, the second server hade been choosed. Therfore, to decide more efficient, both of parameters are necessary.These informations are deterministics and mentioned as percent and numbers. RAM utilization and RAM capacity are another criterias which have incredible role in server load.The next pair of parameters are Network utilization and server Network band width.

There are some methods for solving Multiple Criteria Decision-Making problems, of which one is the TOPSIS method.The principle behind TOPSIS is simple: The cho-sen alternative should be as close to the ideal solution as possible and as far from the negative-ideal solution as possible. The ideal solution is formed as a composite of the best performance values exhibited (in the decision matrix) by any alternative for each attribute. The nega-tive-ideal solution is the composite of the worst perfor-mance values. Proximity to each of these performance poles is measured in the Euclidean sense (e.g., square root of the sum of the squared distances along each axis in the "attribute space"), with optional weighting of each ttriute.

TOPSIS algorithm can receive three types of information, including deterministic, linguistic, and fuzzy information. These three types of data are indeed parameters affecting the decision making process for selecting the overloaded servers. But in previous methods only the crisp (ordinary) values constitute the decision making process input. Number of VM in every host is a linguistic parameter. Specification of each node differ from others. It means that the number of virtual machine that can handle is dif-ferent. For example 4 virtual machines do not impose an annoying load on a specified server while over another weak node available on the cluster can not be han-deled.Therefore, we mentione this parameter linguistical-ly. In the FDM software tool, the linguistic variables di-vided to seven-levels, ‘very low’, ‘low’, ‘more or less (MoL) low’, ‘medium’, ‘more or less (MoL) high’, ‘high’ and ‘very high’. To apply mathematical formula ove lin-guistic statement, we need to map them with numbers. On way is presented by Saaty. His technique is in table5. In this table numbers 9 is assigned to VH and 1 to VL. Other values are between these two numbers. To change these description to value Based on these as-sumptions, a transformation table can be created as shown in Table 4.

N Name Data Type Type Weight Description

1 CPU% Deterministic Benefit VH CPU usage 2 RAM % Deterministic Benefit ML RAM usage 3 NET % Deterministic Benefit ML NET usage 4 T ABLE SECOND LEVEL PARAMETERS

N Name Data Type Type Weight Description

1 CPU% Deterministic Benefit VH CPU usage by VM 2 RAM % Deterministic Benefit ML RAM usage by VM 3 NET % Deterministic Benefit ML BW usage by VM 4 RAM usage Deterministic Cost H RAM used by VM(GB) 5 QoS Linguistic Benefit H Quality of Service for VM Physical host temperature is one of the parameters used to make better decision. Although there is a relationship between load of a server and its parameter, sometimes changing in temperature with high amplitude is asigh of failure in the system. If the cooling component is down or works poorly the temperature of the node rising sudden-ly and as a concequense server may failed.So, this pare-meter can be used as a predicting failure mechanisim. On the other hand, operating temperature varies from one sever to another depending on the hardware configura-tion.As a result, it is agood idea to describe this parameter as a Fuzzy statement. Regarding fuzzy numbers related to the temperature of every active node are shown in Table III. it should be added that in our experiments, tempera-ture of every host sremeasure in kelvin per and a and b values are left and right limit of the main value of tem-perature. For the Fuzzy parameter, we use Triangular Fuzzy Number. The decision making process in a fuzzy environment is the same as the decision making process in the human brain, because in everyday life, people ana-lyses much inaccurate fuzzy information and then makes decisions. To more flexibility of our algorithm, we defined a para-meter named QoS. It stands for quality of service and in-dicates the importance of each virtual machine. In the other word more QoS degree for a VM forced the migra-tion control unit to more pay attention to. If two VM have the same condition, VM with high QoS will be mi-grated to use more resources on the destination node. Tis parameter is linguistic. T ABLE

RANSFORMATIN FOR F UZZY MEMBERSHIP FUNCTION

Rank Abbreviated Membership function

Very Low VL (30,0,10) Low L (40,10,10) Mol Low ML (50,10,10) Medium M (60,10,10) Mol High MH (70,10,10) High H (80,10,10) Very High VH (90,10,0) T ABLE TRANSFORMATION TABLE

Rank Number

Very Low 1 Low 3 Mol Low 4 Medium 5 Mol High 6 High 7 Very High 9

6 I

MPLEMENTATION AND E VALUATION

To implement our proposed technique, a cluster server with five nodes were applied.Each node has some virtual machines. As Table 5 indicates we have totally 12 virtual machine and for initiation assign RAM to each one. The virtual machines run a mix of applications ranging from Apache and streaming servers to PHP, and MySQL. We run RUBiS on our servers--RUBiS is an open-source mul-ti-tier web application. When we start running the VMs over five node of our cluster, workload of each server varies over time. To see the result of our migration algo-rithm, workload of VMs are set to have their peak in pre-defined time(see Table 5). Fore example PM2 has one vir-tual machine which is idle so there is no peak in its work-load while in PM3 there are four VMs named VM1, VM2, VM3 and VM4 that VM2 is constant load. On the other word, VM1, VM3, and VM4 see their peaks on 450, 400, and 550 second after the cluster starts working respective-ly. Control unit runs every 3 minutes because some of the workloads change quickly. Depends on the VMs nature this interval can be set. Information from all nodes come to the control unit and verify every three minutes to see every hotspot. After inserting information of nodes in to the algorithm(Fig.1). Decision technique tries to sort serv-ers from high to low score. In order to remove a dimen-sion, the decision matrix is normalized and calculated using weighted normalized ratings automatically.The next action is to find the negative as well as the positive ideal solutions. After finding the ideal and negative solu-tions, the distance of each alternative is obtained in an n-dimensional space (n is the number of criteria affecting decision making). T ABLE

ODES OF CLUSTER UNDER TEST AND CORRESPONDING VIR-TUAL MACHINES

PM VM Predefined RAM(MB) Peak time(s) PM1 VM 5 256 10 VM6 256 ------------ PM2 VM7 128 ------------ PM3 VM1 512 450 VM2 128 ------------ VM3 128 400 VM4 256 550 PM4 VM8 128 100 VM9 256 ------------ VM10 128 235 PM5 VM11 128 134 VM12 256 ------------

Fig.1.Fuzzy Decision Making(stage 1)

The final scores of each parameter is its relative closeness to the positive ideal solution. These processes are per-formed by the algorithm tool and the user enters only the input information such as selection criteria, their effective weight and selection alternatives.As figure 2 shows the physical server 3 has obtained the highest score.Then con-trol unit verifies if this volume braeks the threshold de-fine by the administrator or not.If crossing accures It means that this node is in danger and migration is inevit-able. During our test, the threshold set to 75. In example shows in figure 1. Physical node 3 has been overloaded and breaks the threshold. Therefore; the output of this program is a list of server which PM3 is at top and PM2 is the last entry(Fig.2). In next stage another program runs over a different set of parameter to find the VM causing hotspot in PM3(see Fig.3).

Fig.2.Results of first stage

Fig.3.Finding VM to migrate(stage 2) RAM usage is the parameter that has Cost Type, concequently the TOPSIS algorithm find the VM which has the lowest RAM usage to avoid of transferring larg data over cluster. Result of the second program identified VM3 as the best candidate to migrate(Fig.4). Fig.4.Result of stage 2 If we do not use algorithm to balance the load of the clus-ter, some nodes will down and services on them stop working. Fore example, node PM3 see bottleneck after 400 seconds. Load of node PM3 has spike at 400, 450, and 550 seconds. Without applying a technique to decrease of load over PM3, this node will be overloaded. Fig.5 indi-cates this situation. Moreover; Workload of every nodes over time ha been showed there. Fig.5.PMs load over time and the threshold

Using load balancing control unit we can detedct hotspot on PM3 and via migrate VM3 from PM3 to PM2 (the least loaded server) the load will be distributed better and re-sponse time of the cluster improved(see Fig.6). In this figure, load of PM3 by reloacating VM3 decreased to about 60% and the hot spot was eliminated. On the other hand. Workload on PM2 increased because it hosted VM3. If sufficient resources are not available, then the algo-rithm examines the next least loaded server and so on, until a match is found for the candidate VM. If no physi-cal server can house the selected VM, then the algorithm moves on to the next VM and attempts to move it in a similar fashion. The process repeats until the utilizations of all resources on the physical server fall below the thre-sholds.

Fig.6.Eliminating hot spot of node PM3 A LGORITHM O VERHEAD

Although the disadvantages of numerical ranking me-thods have been removed partly in the decision making models offered, this methods have its own limits. In deci-sion making models, which are based on multicriteria decision making techniques, there is no limitation on the number of criteria and alternatives, but these models face the problem of time-consuming calculations(Fig.7). This figure illustrates that complexity of TOPSIS algorithm increases whe the number of VMs run on the cluster in-crese. Reducing the migration overhead (i.e., the amount of data transferred) is important, since Xen’s live migra-tion mechanism works by iteratively copying the memory image of the VM to the destination while keeping track of which pages are being dirtied and need to be resent. This requires Xen to intercept all memory accesses for the mi-grating domain, which significantly impacts the perfor-mance of the application inside the VM. By reducing the amount of data copied over the network, we can minim-ize the total migration time, and thus, the performance impact on applications. Note that network bandwidth available for application use is also reduced due to the background copying during migrations. In sandpiper[67], to determine which VMs to migrate, the algorithm orders

Fig.7.Time used to run TOPSIS algorithm when number of virtual machines increase physical servers in decreasing order of their volumes. Within each server, VMs are considered in decreasing order of their volume-to-size ratio (VSR); where VSR is defined as Volume/Size; size is the memory footprint of the VM. By considering VMs in VSR order, the algorithm attempts to migrate the maximum load per unit byte moved, which has been shown to minimize migration overhead [20]. In our algorithm, to minimize the migrat-ing data, another property of TOPSIS technique has been applied.For more illustration, with referring to Table1,we can see that there is a column in the table that labled with Type. This column gets two word, Benefit or Cost. In the other word, when we lable a parameter as Cost,it means that we want to select the Alternative which has this pa-rameter as less possible as. For the Benefit, it is vise versa. As shown in Figure 3, Type of parameter RAM usage is cost because we want to choose a VM from all virtual ma-chines run simultaneously on PM3 that has more CPU%,RAM %, NET %,QoS quantity but less RAM usage. C ONCLUSIONS A ND F UTURE W ORK

The hot node sorting of virtualized cluster servers is a critical point and strategic issue in the migratin process. This decision involves many parameters that are interre-lated in that changes in some parameters affect the others. This paper has discussed cluster node selection in a fuzzy environment and uncertain linguistic value of variables. Fuzzy TOPSIS is a viable method for the proposed prob-lem and is suitable for the use of linguistic variables. When the decision making condition is vague and inaccu-rate, then this method is the preferred technique. The present study explored the use of Fuzzy TOPSIS in find-ing the most critical server and the least one. The proposed model can be a suitable tool to rank serv-ers. A real case was studied and Fyzzy selection algo-rithm applied. The systematic evaluation by Fuzzy TOPSIS of machine selection problems can reduce the risk of a poor choice. The Fuzzy TOPSIS is one of the compensatory decision making methods. As mentioned before, in this method decreasing the score of one parameter is compensated by increasing the score of other parameter(s) and vice versa. Moreover, there is no limitation on the number of alterna-tives and criteria. By applying the FDM model, based on Fuzzy TOPSIS, a strategy was offered to extract a list of server from the most to the least loaded. This strategy has advantages in comparison with previous numerical rank-ing (scoring) methods such as Sandpiper. These advan-tages are a strong theoretical base on fuzzy logic, the abil-ity of sensitivity analysis, the direct usage of linguistic variables in the selection process, unlimited alternatives and criteria, and, most important of all, the possibility of considering the mutual effects of different parameters in the selection process. In fact, TOPSIS is one of the com-pensatory multiattribute decision making models. More-over, this model considers the uncertainty associated with the input parameters (linguistic variables) used in the selection process. A CKNOWLEDGMENT

The authors wish to thank Mis Zahra Palizdar for her contribution in developing this article. R EFERENCES [1]

K. Appleby, S. Fakhouri, L. Fong, M. Goldszmidt, S. Krishnakumar, D. Pazel, J. Pershing, and B. Rochwerger. Oceano - sla-based manage-ment of a computing utility. In Proc. IFIP/IEEE Symposium on Inte-grated Management, May 2001. [2]

M. Aron, P. Druschel, and W. Zwaenepoel. Cluster reserves: A mechan-ism for resource management in cluster-based network servers. In Proc. ACM SIGMETRICS '00. [3]

P. Barham, B. Dragovic, K. Fraser, S. Hand, T. Harris, A. Ho, R. Neugebauer, I. Pratt, and A. Warfield. Xen and the art of virtualiza-tion. In Proc. SOSP'03, pages 164-177, October 2003. [4]

G. P. Box, G. M. Jenkins, and G. C. Reinsel. Time Series Analysis Fore-casting and Control Third Edition. Prentice Hall, 1994. [5]

J. Chase, D. Anderson, P. Thakar, A. Vahdat, and R. Doyle. Managing energy and server resources in hosting centers. In Proc. SOSP '01. [6]

C. Clark, K. Fraser, S. Hand, J. G. Hansen, E. Jul, C. Limpach, I. Pratt, andA. Warfield. Live migration of virtual machines. In Proc. NSDI '05, May 2005. [7]

K. Govil, D. Teodosiu, Y. Huang, and M. Rosenblum. Cellular disco: Resource management using virtual clusters on shared-memorymultiprocessors. In Proc. SOSP'99, pages 154-169, December 1999. [8]

L. Grit, D. Irwin, A. Yumerefendi, and J. Chase. Virtual Machine Host-ing for Networked Clusters: Building the Foundations for Autonomic Orchestration. In Proc. VTDC '06. [9]

D. Gupta, L. Cherkasova, R. Gardner, and A. Vahdat. Enforcing Per-formance Isolation across Virtual Machines in Xen. In Proc. Middle-ware '06, October, 2006. [10]

Michael L. Powell and Barton P. Miller. Process migration in DE-MOS/MP. In Proceedings of the ninth ACM Symposium on Operating System Principles, pages 110.119. ACM Press, 1983. [11]

Marvin M. Theimer, Keith A. Lantz, and David R. Cheriton. Preempta-ble remote execution facilities for the V-system. In Proceedings of the tenth ACM Symposium on Operating System Principles, pages 2.12. ACM Press, 1985. [12]

Eric Jul, Henry Levy, Norman Hutchinson, and Andrew Black. Fine-grained mobility in the emerald system. ACM Trans. Comput. Syst., 6(1):109.133, 1988. [13]

Fred Douglis and John K. Ousterhout. Transparent process migration: Design alternatives and the Sprite implementation. Software - Practice and Experience, 21(8):757.785, 1991. [14]

A. Barak and O. La'adan. The MOSIX multicomputer operating system for high performance cluster computing. Journal of Future Generation Computer Systems, 13(4-5):361.372 March 1998. [15]

D. Gupta, R. Gardner, and L. Cherkasova. Xenmon: Qos monitoring and performance profiling tool. Technical Report HPL-2005-187, HP Labs, 2005. [16]

S. Jones, A. Arpaci-Dusseau, and R. Arpaci-Dusseau. Geiger: Monitor-ing the buffer cache in a virtual machine environment. In Proc. AS-PLOS'06, pages 13-23, October 2006. [17]

A. Kamra, V. Misra, and E. Nahum. Yaksha: A self-tuning controller for managing the performance of 3-tiered web sites. In Proc. IWQoS '04, June 2004. [18]

L. Kleinrock. Queueing Systems, Volume 2: Computer Applications. John Wiley and Sons, Inc., 1976. [19]

D. Menasce and M. Bennani. Autonomic Virtualized Environments. In IEEE ICAS 06. [20]

M. Nelson, B. Lim, and G. Hutchins. Fast Transparent Migration for Virtual Machines. In Proc. USENIX 2005. [21]

S. Osman, D. Subhraveti, G. Su, and J. Nieh. The design and implemen-tation of zap: A system for migrating computing environments, In Proc. OSDI 2002. [22]

S. Ranjan, J. Rolia, H. Fu, and E. Knightly. Qos-driven server migration for internet data centers. In Proc. IWQoS 2002. [23]

P. Ruth, J. Rhee, D. Xu, R. Kennell, and S. Goasguen. Autonomic Live Adaptation of Virtual Computational Environments in a Multi-Domain Infrastructure. In Proc. IEEE ICAC '06. [24]

C. Sapuntzakis, R. Chandra, B. Pfaff, J. Chow, M. Lam, and M. Rosenblum. Optimizing the migration of virtual computers. In Proc. OSDI '02. [25]

V. Sundaram, T. Wood, and P. Shenoy. Efficient Data Migration in Self-managing Storage Systems. In Proc. ICAC '06. [26]

A. Sundararaj, A. Gupta, and P. Dinda. Increasing Application Perfor-mance in Virtual Environments through Run-time Inference and Adap-tation. In Proc. HPDC '05. [27] M. M. Theimer, K. A. L., and D. R. Cheriton. Preemptable Remote Execution Facilities for the V-System. In Proc. SOSP December 1985. [28]

B. Urgaonkar, P. Shenoy, A. Chandra, and P. Goyal. Dynamic provi-sioning for multi-tier internet applications. In Proc. ICAC '05, June 2005. [29]

KARADOGAN, A., KAHRIMAN, A., and OZER, U. Application of fuzzy set theory in the selection of underground mining method. The Journal of the South African Institute of Mining and Metallurgy, 2008. vol. 108. pp. 73–79. [31]

BELLMAN, R.E. and ZADEH, L.A. Decision Making In a Fuzzy Envi-ronment, Management Science, vol. 17, 1970. pp. 141–164. [32]

SAATY, T.L. Decision-making for Leaders, RWS Publication, USA. 1990. [33]

MACHARIS, C., SPRINGAEL, J., DE BRUCKER, K., and VERBEKE, A. PROMETHEE and AHP: The design of operational synergies in multi-criteria analysis, Strengthening PROMETHEE with ideas of AHP. Eu-ropean Journal of operational Research, vol. 153, 2004. pp. 307–317. [34]

SHYUR, H.J. Cost Evaluation Using Modified TOPSIS and ANP. Ap-pliedmathematics and computation, vol. 177, 2006. pp. 251–259. [35]

SHYUR, H.J. and SHIH, H.S. A hybrid MCDM model for Strategic vendor selection. Mathematical and Computer Modeling, vol. 44, 2006. pp. 749–761. [36]

SHIH, H.S., SHYUR, H.J., and LEE, E.S. An extension of TOPSIS for Group Decision Making. Mathematical and Computer Modeling, vol. 45, 2007. pp. 801–813. [37]

ABO-SINNA, M.A. and AMER, A.H. Extensions of for multi-objective largescale nonlinear programming problems. Applied Mathematics and Computation, vol. 162, 2005. pp. 243–256. [38]

AGRAWAL, V.P., KOHLI, V., and GUPTA, S. Computer aided robot selection: The multiple attribute decision making approach. Interna-tional Journal of Production Research, vol. 29, 1991. pp. 1629–1644. [39]

CHENG, S., CHAN, C.W., and HUANG, G.H. An integrated multi-criteria decision analysis and inexact mixed integer linear programming approach for solid waste management. Engineering Applications of Ar-tificial Intelligence, vol. 16, 2003. pp. 543–554. [40]

DENG, H., YEH, C.H., and WILLIS, R.J. Inter-company comparison using modified with objective weights. Computers and Operations Re-search, vol. 27, 2000. pp. 963–973. [41]

FENG, C.M. and WANG, R.T. Performance evaluation for airlines including the consideration of financial ratios. Journal of Air Transport Management, vol. 6, 2000. pp. 133–142. [42]

FENG, C.M. and WANG, R.T. considering the financial ratios on the performance evaluation of highway bus industry. Transport Reviews, vol. 21, 2001. pp. 449–467. [43]

HWANG, C.L. and YOON, K. Multiple attribute decision making: Methods and applications. Berlin: Springer. 1981 [44]

JEE, D.H. AND KANG, K.J. A method for optimal material selection aided with decision making theory. Materials and Design, vol. 21, 2000. pp. 199–206. [45]

KIM, G., PARK, C.S., and YOON, K.P. Identifying investment oppor-tunities for advanced manufacturing systems with comparative-integrated performance measurement. International Journal of Produc-tion Economics, vol. 50, 1997. pp. 23–33. [46]

LAI, Y.J., LIU, T.Y., and HWANG, C.L. For MODM. European Journal of Operational Research, vol. 76, 1994. pp. 486–500. [47]

LIAO, H.C. Using PCR- to optimize Taguchi’s multi-response problem. The International Journal of Advanced Manufacturing Technology, vol. 22, 2003. pp. 649–655. [48]

OLSON, D.L. Comparison of weights in models. Mathematical and Computer Modelling, vol. 40, 2004. pp. 721–727. [49]

OPRICOVIC, S., and TZENG, G.H. Compromise solution by MCDM methods: A comparative analysis of VIKOR. European Journal of Op-erational Research, vol. 156, 2004. pp. 445–455. [50]

PARKAN, C. and WU, M.L. On the equivalence of operational perfor-mance measurement and multiple attribute decision making. Interna-tional Journal of Production Research, vol. 35, 1997. pp. 2963–2988. [51]

PARKAN, C. AND WU, M.L. Decision-making and performance measurement models with applications to robot selection. Computers and Industrial Engineering, vol. 36, 1999. pp. 503–523. [52]

TONG, L.I. and SU, C.T. Optimizing multi-response problems in the Taguchi method by fuzzy multiple attribute decision making. Quality and Reliability Engineering International, vol. 13, 1997. pp. 25–34. [53]

TZENG, G.H., LIN, C.W., and OPRICOVIC, S. Multi-criteria analysis of alternative-fuel buses for public transportation. Energy Policy, vol. 33, 2005. pp. 1373–1383. [54]

ZANAKIS, S.H., SOLOMON, A., WISHART, N., and DUBLISH, S. Multi-attribute decision making: A simulation comparison of select me-thods. European Journal of Operational Research, vol. 107, 1998. pp. 507–529. [55]

CHEN, C.T. Extension of the for group decision-making under fuzzy environment. Fuzzy Sets and Systems, 2000. pp. 114, 1–9. [56]

CHEN, M.F. and TZENG, G.H. Combining grey relation and concepts for selecting an expatriate host country. Mathematical and Computer Modelling, vol. 40, 2004. pp. 1473–1490. [57]

Chu, T. C. Facility location selection using fuzzy TOPSIS under group decisions. International Journal of Uncertainty, Fuzziness and Know-ledge-Based Systems, 2002. Vol. 10, Pp. 687–701. [58]

CHU, T.C. Selecting plant location via a fuzzy TOPSIS approach. The International Journal of Advanced Manufacturing Technology, 2002, vol. 20, pp. 859–864. [59]

CHU, T.C. and LIN, Y.C. A fuzzy TOPSIS method for robot selection. The International Journal of Advanced Manufacturing Technology, vol. 21, 2003. pp. 284–290. [60]

KAUFMANN, A. and GUPTA, M.M. Introduction to fuzzy arithmetic: Theory and applications. New York: VanNostrand-Reinhold. 1991 [61]

TRIANTAPHYLLOU, E. and LIN, C.T. Development and evaluation of five fuzzy multiattribute decision-making methods. International Journal of Approximate Reasoning, vol. 14, 1996. pp. 281–310. [62]

YING-MING WANG and TAHA, M.S. Elhag. Fuzzy method based on alpha level sets with an application to bridge risk assessment. Expert systems with applications, 2005. pp. 1–11. [63]

ZADEH, L.A. Fuzzy sets. Information control. vol. 8, 1965. pp. 338–353. [64]

MEAMARIANI, A. FDM software (Fuzzy Decision Meaking). Tarbiat Modares University. Tehran, Iran. 2003. [65]

D. Milojicic, F. Douglis, Y. Paindaveine, R. Wheeler,and S. Zhou. Process migration. ACM Computing Surveys, 32(3):241.299, 2000. [66]

VMWare, Inc. VMWare VirtualCenter Version 1.2 User's Manual. 2004. [67]

Black-box and Gray-box Strategies for Virtual Machine Migration. Timothy Wood, Prashant Shenoy, Arun Venkataramani, and Mazin Yousif. To appear in Computer Networks Journal Special Issue on Vir-tualized Data Centers 2009. (Extended version of NSDI 07 paper) [68]

VMWare, Inc. DRS Performance and Best Practices, 2008 [69]

VMWare, Inc. drs_datasheet.pdf

M.Tarighi is now PHd student in Tehrn Polytechnique University. He graduated from SHAHID SHAMRAN university of AHWAZ in BSc of Electronics & from Amir Kabir University of technology in MSc of Electronics. His interests are currently Cluster Computing, Virtualiza-tion, and Decision Algorithms.Beside that, soccer, Internet, Volunteer works, chess,is now PHd student in Tehrn Polytechnique University. He graduated from SHAHID SHAMRAN university of AHWAZ in BSc of Electronics & from Amir Kabir University of technology in MSc of Electronics. His interests are currently Cluster Computing, Virtualiza-tion, and Decision Algorithms.Beside that, soccer, Internet, Volunteer works, chess,