Riccardo LANCELLOTTI - personale UniMoRe

Nuova ricerca

Riccardo LANCELLOTTI

Professore Associato
Dipartimento di Ingegneria "Enzo Ferrari"

Pubblicazioni

2024 - An Optimization-Based Decision Support System for Multi-trip Vehicle Routing Problems [Articolo su rivista]
Cavecchia, Mirko; ALVES DE QUEIROZ, Thiago; Iori, Manuel; Lancellotti, Riccardo; Zucchi, Giorgio
abstract

Decision support systems (DSS) are used daily to make complex and hard decisions. Developing a DSS is not an easy task and may require combining different approaches to reach accurate and timely responses. In this paper, we present a DSS based on a micro-service architecture that we developed to handle a variant of the vehicle routing problem. The DSS has been implemented for a service company operating in the field of pharmaceutical distribution, and it helps decision-makers define the routes that different types of vehicles need to perform during the day to serve the customers’ demands. The underlying optimization problem assumes that a vehicle can perform multiple routes daily and is constrained to operate within a given time horizon. Customers are characterized by hard time windows on the delivery times. The proposed DSS first handles geo-referencing and distance calculation tasks. Then, it invokes a two-step optimization approach in which vehicle routes are generated and combined to reduce the number of vehicles used. For the latter task, we propose and evaluate four solution methods: two greedy heuristics, a metaheuristic, and a mathematical model. All the methods are applied to solve real and randomly generated instances, showing that the metaheuristic algorithm is superior to the others in terms of solution quality and computing time. The company had a very positive feedback on the proposed DSS and is now using it to support its daily operations.

2023 - A Decision Support System for Multi-Trip Vehicle Routing Problems [Relazione in Atti di Convegno]
Cavecchia, Mirko; ALVES DE QUEIROZ, Thiago; Iori, Manuel; Lancellotti, Riccardo; Zucchi, Giorgio
abstract

Emerging trends, driven by industry 4.0 and Big Data, are pushing to combine optimization techniques with Decision Support Systems (DSS). The use of DSS can reduce the risk of uncertainty of the decision-maker regarding the economic feasibility of a project and the technical design. Designing a DSS can be very hard, due to the inherent complexity of these types of systems. Therefore, monolithic software architectures are not a viable solution. This paper describes the DSS developed for an Italian company based on a micro-services architecture. In particular, the services handle geo-referenced information to solve a multi-trip vehicle routing problem with time windows. To face the problem, we follow a two-step approach. First, we generate a set of routes solving a vehicle routing problem with time windows using a metaheuristic algorithm. Second, we calculate the interval in which each route can start and end, and then combine the routes together, with an integer linear programming model, to minimize the number of used vehicles. Computational tests are conducted on real and random instances and prove the efficiency of the approach.

2023 - A Validated Performance Model for Micro-services Placement in Fog Systems [Articolo su rivista]
Canali, C.; Di Modica, G.; Lancellotti, R.; Rossi, S.; Scotece, D.
abstract

The recent evolutionary trend of modern applications is towards a development paradigm that involves the composition of multiple interconnected micro-services devoted to perform specific functions. Such applications usually rely on data collected by geographically distributed sensors or by mobile users and are often characterized by strict requirements in terms of latency and response time. These requirements may be not compatible with the traditional cloud computing approach, where the computation occurring on far-away data centers cannot always guarantee the satisfaction of latency constraints. The fog computing approach has recently received a lot of attention as a promising solution in supporting time-critical applications. Due to an intermediate layer of fog nodes located close to sensors or final users and able to process the application data, indeed, the fog systems may significantly reduce the experienced response time. In a scenario where applications are composed by a chain of multiple micro-services, however, the service placement over the nodes of the fog infrastructure represents a nontrivial issue with respect to the cloud computing context. The highly distributed and heterogeneous nature of the fog nodes requires novel solutions taking into account the different performance of the fog nodes and the network delays caused by inter-nodes connectivity. This paper proposes a performance model for the placement of application micro-services over the fog infrastructure. To face the computational complexity of the optimization model, an heuristic based on a genetic algorithm is proposed. Furthermore, the analytical model is validated by means of simulation. The performance of the proposed solution is evaluated under a wide set of scenario and parameters ranges, including a case study based on realistic micro-services characterized through a prototype implementation.

2023 - Placement of IoT Microservices in Fog Computing Systems: A Comparison of Heuristics [Articolo su rivista]
Canali, C.; Gazzotti, C.; Lancellotti, R.; Schena, F.
abstract

In the last few years, fog computing has been recognized as a promising approach to support modern IoT applications based on microservices. The main characteristic of this application involve the presence of geographically distributed sensors or mobile end users acting as sources of data. Relying on a cloud computing approach may not represent the most suitable solution in these scenario due to the non-negligible latency between data sources and distant cloud data centers, which may represent an issue in cases involving real-time and latency-sensitive IoT applications. Placing certain tasks, such as preprocessing or data aggregation, in a layer of fog nodes close to sensors or end users may help to decrease the response time of IoT applications as well as the traffic towards the cloud data centers. However, the fog scenario is characterized by a much more complex and heterogeneous infrastructure compared to a cloud data center, where the computing nodes and the inter-node connecting are more homogeneous. As a consequence, the the problem of efficiently placing microservices over distributed fog nodes requires novel and efficient solutions. In this paper, we address this issue by proposing and comparing different heuristics for placing the application microservices over the nodes of a fog infrastructure. We test the performance of the proposed heuristics and their ability to minimize application response times and satisfy the Service Level Agreement across a wide set of operating conditions in order to understand which approach is performs the best depending on the IoT application scenario.

2022 - An Optimization View to the Design of Edge Computing Infrastructures for IoT Applications [Capitolo/Saggio]
de Queiroz, Thiago Alves; Canali, Claudia; Iori, Manuel; Lancellotti, Riccardo
abstract

Internet of Things (IoT) based applications have recently experienced a remarkable diffusion in many different contexts, such as automotive, e-health, public security, industrial applications, energy, and waste management. These kinds of applications are characterized by geographically distributed sensors that collect data to be processed through algorithms of Artificial Intelligence (AI). Due to the vast amount of data to be processed by AI algorithms and the severe latency requirements of some applications, the emerging Edge Computing paradigm may represent the preferable choice for the supporting infrastructure. However, the design of edge computing infrastructures opens several new issues concerning the allocation of data flows coming from sensors over the edge nodes, and the choice of the number and the location of the edge nodes to be activated. The service placement issue can be modeled through a multi-objective optimization aiming at minimizing two aspects: the response time for data transmission and processing in the sensors-edge-cloud path; the (energy or monetary) cost related to the number of turned on edge nodes. Two heuristics, based on Variable Neighborhood Search and on Genetic Algorithms, are proposed and evaluated over a wide range of scenarios, considering a realistic smart city application with 100 sensors and up to 10 edge nodes. Both heuristics can return practical solutions for the given application. The results indicate a suitable topology for a network-bound scenario requires less enabled edge nodes comparatively with a CPU-bound scenario. In terms of performance gain, the VNS outperformed in almost every condition the GA approach, reaching a performance gain up to almost 40% when the network delay plays a significant role and when the load is higher. Hence, the experimental tests demonstrate that the proposed heuristics are useful to support the design of edge computing infrastructures for modern AI-based applications relying on data collected by geographically distributed IoT sensors.

2022 - Microservice Performance in Container- and Function-as-a-Service Architectures [Relazione in Atti di Convegno]
Canali, C.; Lancellotti, R.; Pedroni, P.
abstract

Function-as-a-Service (FaaS) is a new cloud-based computing model that promises a more cost-efficient deployment of microservices with respect to other cloud paradigms, like Container-as-a-Service (CaaS). However, requests served under a FaaS approach often experience a cold start condition, that occurs when the execution of an inactive function occurs for the first time and a container environment has to be set up afresh. In such cases, performance deteriorates and response times increase. This paper proposes an analysis of the performance of the Function-as-a-Service model for two single offered microservices. Specifically, we carry out a performance evaluation of the Function-as-a-Service model, implemented through OpenWhisk, using as a baseline for comparison the Container-as-a-Service approach, implemented with Docker. Our analysis focuses on metrics related to the response time and to the usage of main server resources such as CPU and memory. For the performance comparison, we exploited two different microservices based on face recognition and image conversion, respectively, in order to evaluate the performance over popular and modern kinds of services included in artificial intelligence and multimedia applications.

2022 - On the impact of stale information on distributed online load balancing protocols for edge computing [Articolo su rivista]
Beraldi, R.; Canali, C.; Lancellotti, R.; Mattia, G. P.
abstract

The distributed nature of edge computing infrastructures requires a significant effort to avoid overload conditions due to uneven distribution of incoming load from sensors placed over a wide area. While optimisation algorithms operating offline can address this issue in the medium to long term, sudden and unexpected traffic surges require an online approach where load balancing actions are taken at a smaller time scale. However, when the service time of a single request becomes comparable with the latency needed to take and actuate load balancing decisions, the design of online approaches becomes particularly challenging. This paper focuses on the class of online algorithms for load balancing based on resource sharing among random nodes. While this randomisation principle is a straightforward and effective way to share resources and achieve load balance, it fails to work properly when the interval between decision making and decision actuating times (called schedule lag) becomes comparable with the time required to execute a job, a condition not rare in edge computing systems, and provokes stale (out-of-date) information to be involved in scheduling decisions. Our analysis combines (1) a theoretical model that evaluates how stale information reduces the effectiveness of the balancing mechanism and describes the correlation between the system state at decision making and decision actuating times; (2) a simulation approach to study a wide range of algorithm parameters and possible usage scenarios. The results of our analysis provides the designers of distributed edge systems with useful hints to decide, based on the scenario, which load balancing protocol is the most suitable.

2022 - Optimal Placement of Micro-services Chains in a Fog Infrastructure [Relazione in Atti di Convegno]
Canali, C.; Di Modica, G.; Lancellotti, R.; Scotece, D.
abstract

Fog computing emerged as a novel approach to deliver micro-services that support innovative applications. This paradigm is consistent with the modern approach to application development, that leverages the composition of small micro-services that can be combined to create value-added applications. These applications typically require the access from distributed data sources, such as sensors located in multiple geographic locations or mobile users. In such scenarios, the traditional cloud approach is not suitable because latency constraints may not be compatible with having time-critical computations occurring on a far away data-center; furthermore, the amount of data to exchange may cause high costs imposed by the cloud pricing model. A layer of fog nodes close to application consumers can host pre-processing and data aggregation tasks that can reduce the response time of latency-sensitive elaboration as well as the traffic to the cloud data-centers. However, the problem of smartly placing micro-services over fog nodes that can fulfill Service Level Agreements is far more complex than in the more controlled scenario of cloud computing, due to the heterogeneity of fog infrastructures in terms of performance of both the computing nodes and inter-node connectivity. In this paper, we tackle such problem proposing a mathematical model for the performance of complex applications deployed on a fog infrastructure. We adapt the proposed model to be used in a genetic algorithm to achieve optimized deployment decisions about the placement of micro-services chains. Our experiments prove the viability of our proposal with respect to meeting the SLA requirements in a wide set of operating conditions.

2022 - Performance Comparison of Technological Solutions for Spark Applications in AWS [Relazione in Atti di Convegno]
Lancellotti, R.; Rossi, S.; Miano, G. C.; Miselli, F.
abstract

Cloud computing is providing a pay-as-you-go in-frastructure for the deployment of complex applications, with auto-scaling support and the ability to manage and process huge amount of data. However, due to the underlying complexity of the cloud infrastructure, it is not trivial to evaluate the setup providing the best performance of such scenario. To this aim the present paper proposes a thorough performance evaluation of a real application in a Cloud platform, measuring the impact of several design choices and technological solution. The experimental results, based on a real application and on realistic data can provide a significant insight that can integrate the traditional approach of cloud performance evaluation based on synthetic benchmarks.

2021 - A Hierarchical Receding Horizon Algorithm for QoS-driven control of Multi-IaaS Applications [Articolo su rivista]
Ardagna, Danilo; Ciavotta, Michele; Lancellotti, Riccardo; Guerriero, Michele
abstract

Cloud Computing is emerging as a major trend in ICT industry. As with any new technology, new major challenges lie ahead, one of them concerning the resource provisioning. Modern Cloud applications deal with a dynamic context that requires a continuous adaptation process to meet satisfactory QoS. Unfortunately, current Cloud platforms provide just simple rule-based tools that can be unsuitable in many situations as they do not prevent SLA violations, but only react to them. This situation calls for advanced solutions designed to provide Cloud resources in a predictive and dynamic way. This work presents capacity allocation algorithms whose goal is to minimize the total execution cost while satisfying some constraints on the average response time of Cloud based applications. An extensive evaluation of our solution against an Oracle with perfect knowledge of the future and well-known heuristics presented in the literature is provided. The analysis shows that our solution outperforms the heuristics producing results very close to the optimal ones and reducing the number of QoS violations. Analytical results are validated also through simulation, which analyses the impact of Cloud environment random perturbations. Finally, experiments on a prototype environment demonstrate the effectiveness of our approach under real workloads.

2021 - A Variable Neighborhood Heuristic for Facility Locations in Fog Computing [Relazione in Atti di Convegno]
Alves de Queiroz, T.; Canali, C.; Iori, M.; Lancellotti, R.
abstract

The current trend of the modern smart cities applications towards a continuous increase in the volume of produced data and the concurrent need for low and predictable latency in the response time has motivated the shift from a cloud to a fog computing approach. A fog computing architecture is likely to represent a preferable solution to reduce the application latency and the risk of network congestion by decreasing the volume of data transferred to cloud data centers. However, the design of a fog infrastructure opens new issues concerning not only how to allocate the data flow coming from sensors to fog nodes and from there to cloud data centers, but also the choice of the number and the location of the fog nodes to be activated among a list of potential candidates. We model this facility location issue through a multi-objective optimization problem. We propose a heuristic based on the variable neighborhood search, where neighborhood structures are based on swap and move operations. The proposed method is tested in a wide range of scenarios, considering a smart city application’s realistic setup with geographically distributed sensors. The experimental evaluation shows that our method can achieve stable and better performance concerning other literature approaches, supporting the given application.

2021 - Impact of theoretical performance models on the design of fog computing infrastructures [Relazione in Atti di Convegno]
Canali, C.; Lancellotti, R.; Rossi, S.
abstract

The Fog Computing paradigm is increasingly seen as the most promising solution to support Internet of Things applications and satisfy their requirements in terms of response time and Service Level Agreements. For these applications, fog computing offers the great advantage of reducing the response time thanks to the layer of intermediate nodes able to perform pre-processing, filtering and other computational tasks. However, the design of a fog computing infrastructure opens new issues concerning the allocation of data flows coming from sensors over the fog nodes, and the choice of the number of the fog nodes to be activated. Many studies rely on a simplified assumption based on a M/M/1 theoretical queuing model to determine the optimal solution for the fog infrastructure design, but such simplification may result in a mismatch between predicted and achieved performance of the model. In this paper, we measure the aforementioned discordance in terms of response time and SLA compliance. Furthermore, we explore the impact of non-Poissonian service models and validate our results by means of simulation. Our experiments demonstrate that the use of M/M/1 model could lead to SLA violations. On the other hand, the use of sophisticated models for the estimation of the response time can avoid this problem.

2020 - A Location-allocation model for fog computing infrastructures [Relazione in Atti di Convegno]
De Queiroz, T. A.; Canali, C.; Iori, M.; Lancellotti, R.
abstract

The trend of an ever-increasing number of geographically distributed sensors producing data for a plethora of applications, from environmental monitoring to smart cities and autonomous driving, is shifting the computing paradigm from cloud to fog. The increase in the volume of produced data makes the processing and the aggregation of information at a single remote data center unfeasible or too expensive, while latency-critical applications cannot cope with the high network delays of a remote data center. Fog computing is a preferred solution as latency-sensitive tasks can be moved closer to the sensors. Furthermore, the same fog nodes can perform data aggregation and filtering to reduce the volume of data that is forwarded to the cloud data centers, reducing the risk of network overload. In this paper, we focus on the problem of designing a fog infrastructure considering both the location of how many fog nodes are required, which nodes should be considered (from a list of potential candidates), and how to allocate data flows from sensors to fog nodes and from there to cloud data centers. To this aim, we propose and evaluate a formal model based on a multi-objective optimization problem. We thoroughly test our proposal for a wide range of parameters and exploiting a reference scenario setup taken from a realistic smart city application. We compare the performance of our proposal with other approaches to the problem available in literature, taking into account two objective functions. Our experiments demonstrate that the proposed model is viable for the design of fog infrastructure and can outperform the alternative models, with results that in several cases are close to an ideal solution.

2020 - A Random Walk based Load Balancing Algorithm for Fog Computing [Relazione in Atti di Convegno]
Beraldi, R.; Canali, C.; Lancellotti, R.; Mattia, G. P.
abstract

The growth of large scale sensing applications (as in the case of smart cities applications) is a main driver of the fog computing paradigm. However, as the load for such fog infrastructures increases, there is a growing need for coordination mechanisms that can provide load balancing. The problem is exacerbated by local overload that may occur due to an uneven distribution of processing tasks (jobs) over the infrastructure, which is typical real application such as smart cities, where the sensor deployment is irregular and the workload intensity can fluctuate due to rush hours and users behavior. In this paper we introduce two load sharing mechanisms that aim to offload jobs towards the neighboring nodes. We evaluate the performance of such algorithms in a realistic environment that is based on a real application for monitoring in a smart city. Our experiments demonstrate that even a simple load balancing scheme is effective in addressing local hot spots that would arise in a non-collaborative fog infrastructure

2020 - Adaptive Computing-plus-Communication Optimization Framework for Multimedia Processing in Cloud Systems [Articolo su rivista]
Shojafar, Mohammad; Canali, Claudia; Lancellotti, Riccardo; Abawajy, Jemal
abstract

A clear trend in the evolution of network-based services is the ever-increasing amount of multimedia data involved. This trend towards big-data multimedia processing finds its natural placement together with the adoption of the cloud computing paradigm, that seems the best solution to cope with the demands of a highly fluctuating workload that characterizes this type of services. However, as cloud data centers become more and more powerful, energy consumption becomes a major challenge both for environmental concerns and for economic reasons. An effective approach to improve energy efficiency in cloud data centers is to rely on traffic engineering techniques to dynamically adapt the number of active servers to the current workload. Towards this aim, we propose a joint computing-plus-communication optimization framework exploiting virtualization technologies, called MMGreen. Our proposal specifically addresses the typical scenario of multimedia data processing with computationally intensive tasks and exchange of a big volume of data. The proposed framework not only ensures users the Quality of Service (through Service Level Agreements), but also achieves maximum energy saving and attains green cloud computing goals in a fully distributed fashion by utilizing the DVFS-based CPU frequencies. To evaluate the actual effectiveness of the proposed framework, we conduct experiments with MMGreen under real-world and synthetic workload traces. The results of the experiments show that MMGreen may significantly reduce the energy cost for computing, communication and reconfiguration with respect to the previous resource provisioning strategies, respecting the SLA constraints.

2020 - Collaboration Strategies for Fog Computing under Heterogeneous Network-bound Scenarios [Relazione in Atti di Convegno]
Canali, C.; Lancellotti, R.; Mione, S.
abstract

The success of IoT applications increases the number of online devices and motivates the adoption of a fog computing paradigm to support large and widely distributed infrastructures. However, the heterogeneity of nodes and their connections requires the introduction of load balancing strategies to guarantee efficient operations. This aspect is particularly critical when some nodes are characterized by high communication delays. Some proposals such as the Sequential Forwarding algorithm have been presented in literature to provide load balancing in fog computing systems. However, such algorithms have not been studied for a wide range of working parameters in an heterogeneous infrastructure; furthermore, these algorithms are not designed to take advantage from highly heterogeneous network delays that are common in fog infrastructures. The contribution of this study is twofold: First, we evaluate the performance of the sequential forwarding algorithm for several load and delay conditions; second, we propose and test a delay-aware version of the algorithm that takes into account the presence of highly variable node connectivity in the infrastructure. The results of our experiments, carried out using a realistic network topology, demonstrate that a delay-blind approach to sequential forwarding may determine poor performance in the load balancing when network delay represents a major contribution to the response time. Furthermore, we show that the delay-aware variant of the algorithm may provide a benefit in this case, with a reduction in the response time up to 6%.

2020 - Data Flows Mapping in Fog Computing Infrastructures Using Evolutionary Inspired Heuristic [Capitolo/Saggio]
Canali, C.; Lancellotti, R.
abstract

The need for scalable and low-latency architectures that can process large amount of data from geographically distributed sensors and smart devices is a main driver for the popularity of the fog computing paradigm. A typical scenario to explain the fog success is a smart city where monitoring applications collect and process a huge amount of data from a plethora of sensing devices located in streets and buildings. The classical cloud paradigm may provide poor scalability as the amount of data transferred risks the congestion on the data center links, while the high latency, due to the distance of the data center from the sensors, may create problems to latency critical applications (such as the support for autonomous driving). A fog node can act as an intermediary in the sensor-to-cloud communications where pre-processing may be used to reduce the amount of data transferred to the cloud data center and to perform latency-sensitive operations. In this book chapter we address the problem of mapping sensors over the fog nodes with a twofold contribution. First, we introduce a formal model for the mapping model that aims to minimize response time considering both network latency and processing time. Second, we present an evolutionary-inspired heuristic (using Genetic Algorithms) for a fast and accurate resolution of this problem. A thorough experimental evaluation, based on a realistic scenario, provides an insight on the nature of the problem, confirms the viability of the GAs to solve the problem, and evaluates the sensitivity of such heuristic with respect to its main parameters.

2020 - Distributed load balancing for heterogeneous fog computing infrastructures in smart cities [Articolo su rivista]
Beraldi, R.; Canali, C.; Lancellotti, R.; Mattia, G. P.
abstract

Smart cities represent an archetypal example of infrastructures where the fog computing paradigm can express its potential: we have a large set of sensors deployed over a large geographic area where data should be pre-processed (e.g., to extract relevant information or to filter and aggregate data) before sending the result to a collector that may be a cloud data center, where relevant data are further processed and stored. However, during its lifetime the infrastructure may change, e.g., due to the additional sensors or fog nodes deploy, while the load can grow, e.g., for additional services based on the collected data. Since nodes are typically deployed in multiple time stages, they may have different computation capacity due to technology improvements. In addition, an uneven distribution of the workload intensity can arise, e.g., due to hot spot for occasional public events or to rush hours and users’ behavior. In simple words, resources and load can vary over time and space. Under the resource management point of view, this scenario is clearly challenging. Due to the large scale and variable nature of the resources, classical centralized solutions should in fact be avoided, since they do not scale well and require to transfer all data from sensors to a central hub, distorting the very nature of in-situ data processing. In this paper, we address the problem of resources management by proposing two distributed load balancing algorithms, tailored to deal with heterogeneity. We evaluate the performance of such algorithms using both a simplified environment where we perform several sensitivity analysis with respect to the factors responsible for the infrastructure heterogeneity and exploiting a realistic scenario of a smart city. Furthermore, in our study we combine theoretical models and simulation. Our experiments demonstrate the effectiveness of the algorithms under a wide range of heterogeneity, overall providing a remarkable improvement compared to the case of not cooperating nodes.

2020 - Randomized Load Balancing under Loosely Correlated State Information in Fog Computing [Relazione in Atti di Convegno]
Beraldi, R.; Canali, C.; Lancellotti, R.; Mattia, G. P.
abstract

Fog computing infrastructures must support increasingly complex applications where a large number of sensors send data to intermediate fog nodes for processing. As the load in such applications (as in the case of a smart cities scenario) is subject to significant fluctuations both over time and space, load balancing is a fundamental task. In this paper we study a fully distributed algorithm for load balancing based on random probing of the neighbors' status. A qualifying point of our study is considering the impact of delay during the probe phase and analyzing the impact of stale load information. We propose a theoretical model for the loss of correlation between actual load on a node and stale information arriving to the neighbors. Furthermore, we analyze through simulation the performance of the proposed algorithm considering a wide set of parameters and comparing it with an approach from the literature based on random walks. Our analysis points out under which conditions the proposed algorithm can outperform the alternatives.

2020 - Smart cities in the fog: Clearing the vision of innovative sensing applications [Capitolo/Saggio]
Bicocchi, N.; Canali, C.; Lancellotti, R.
abstract

The new paradigm of smart cities is deeply intertwined with the development of large-scale sensing applications. An ever-growing amount of sensors are collecting data to support decision strategies for the management of the city services. Examples of such applications are traffic monitoring, autonomous driving, environmental sensing, real-time power/resource utilization metering. A traditional cloud-based approach for the deployment of such services is likely to suffer from performance and QoS problems due to the risk of congestion on the data center outbound links and due to high latency related to the geographic data exchange. An alternative paradigm to mitigate these problems is the fog computing, where a layer of intermediate fog nodes is placed between the sensors and the cloud data center to reduce the amount of data exchanges (through aggregation and filtering) and to host latency-critical services. The fog computing opens several new issues for the management and deployment of the services, especially if we consider that new applications may be dynamically deployed and also the infrastructure is subject to changes over time (e.g., adding and removing sensors and fog nodes). While this dynamic behavior can be supported by existing technologies such as containers, service orchestration frameworks, and micro-services, the fog paradigm exacerbates the problem of infrastructure and service coordination and management to the point where new solutions must be devised. The critical challenges that should be addressed by future fog infrastructures for smart cities lie in the area of service management, optimization of the infrastructure and automatic deployment of applications. In the present chapter, we discuss advantages and disadvantages of solutions for the management of smart city sensing applications, considering architectures, optimization models, algorithms for the service deployment, and the support for the applications life cycle.

2019 - A Deep-learning-based approach to VM behavior Identification in Cloud Systems [Relazione in Atti di Convegno]
Stefanini, M.; Lancellotti, R.; Baraldi, L.; Calderara, S.
abstract

2019 - A Technique to Identify Data Exchange Between Cloud Virtual Machines [Capitolo/Saggio]
Bicocchi, N.; Canali, C.; Lancellotti, R.
abstract

Modern cloud data centers typically exploit management strategies to reduce the overall energy consumption. While most of the solutions focus on the energy consumption due to computational elements, the optimization of network-related aspects of a data center is becoming more and more important, considering also the advent of the Software-Defined Network paradigm. However, an enabling step to implement network-aware Virtual Machine (VM) allocation is the knowledge of data exchange patterns. In this way we can place in well-connected hosts (or on the same physical host) the couples of VMs that exchange a large amount of information. Unfortunately, in Infrastructure as a Service data centers, a detailed knowledge on VMs data exchange is seldom available without the deployment of a specialized (and costly) monitoring infrastructure. In this paper, we propose a technique to infer VMs communication patterns starting from input/output network traffic time series of each VM. We discuss both the theoretical aspect of such technique and the design challenges for its implementation. A case study is used to demonstrate the viability of our idea.

2019 - A fog computing service placement for smart cities based on genetic algorithms [Relazione in Atti di Convegno]
Canali, C.; Lancellotti, R.
abstract

The growing popularity of the Fog Computing paradigm is driven by the increasing availability of large amount of sensors and smart devices on a geographically distributed area. The scenario of a smart city is a clear example of this trend. As we face an increasing presence of sensors producing a huge volume of data, the classical cloud paradigm, with few powerful data centers that are far away from the data sources, becomes inadequate. There is the need to deploy a highly distributed layer of data processors that filter, aggregate and pre-process the incoming data according to a fog computing paradigm. However, a fog computing architecture must distribute the incoming workload over the fog nodes to minimize communication latency while avoiding overload. In the present paper we tackle this problem in a twofold way. First, we propose a formal model for the problem of mapping the data sources over the fog nodes. The proposed optimization problem considers both the communication latency and the processing time on the fog nodes (that depends on the node load). Furthermore, we propose a heuristic, based on genetic algorithms to solve the problem in a scalable way. We evaluate our proposal on a geographic testbed that represents a smart-city scenario. Our experiments demonstrate that the proposed heuristic can be used for the optimization in the considered scenario. Furthermore, we perform a sensitivity analysis on the main heuristic parameters.

2019 - AGATE: Adaptive Gray Area-based TEchnique to Cluster Virtual Machines with Similar Behavior [Articolo su rivista]
Canali, Claudia; Lancellotti, Riccardo
abstract

As cloud computing data centers grow in size and complexity to accommodate an increasing number of virtual machines, the scalability of monitoring and management processes becomes a major challenge. Recent research studies show that automatically clustering virtual machines that are similar in terms of resource usage may address the scalability issues of IaaS clouds. Existing solutions provides high clustering accuracy at the cost of very long observation periods, that are not compatible with dynamic cloud scenarios where VMs may frequently join and leave. We propose a novel technique, namely AGATE (Adaptive Gray Area-based TEchnique), that provides accurate clustering results for a subset of VMs after a very short time. This result is achieved by introducing elements of fuzzy logic into the clustering process to identify the VMs with undecided clustering assignment (the so-called gray area), that should be monitored for longer periods. To evaluate the performance of the proposed solution, we apply the technique to multiple case studies with real and synthetic workloads. We demonstrate that our solution can correctly identify the behavior of a high percentage of VMs after few hours of observations, and significantly reduce the data required for monitoring with respect to state-of-the-art solutions.

2019 - GASP: Genetic algorithms for service placement in fog computing systems [Articolo su rivista]
Canali, C.; Lancellotti, R.
abstract

Fog computing is becoming popular as a solution to support applications based on geographically distributed sensors that produce huge volumes of data to be processed and filtered with response time constraints. In this scenario, typical of a smart city environment, the traditional cloud paradigm with few powerful data centers located far away from the sources of data becomes inadequate. The fog computing paradigm, which provides a distributed infrastructure of nodes placed close to the data sources, represents a better solution to perform filtering, aggregation, and preprocessing of incoming data streams reducing the experienced latency and increasing the overall scalability. However, many issues still exist regarding the efficient management of a fog computing architecture, such as the distribution of data streams coming from sensors over the fog nodes to minimize the experienced latency. The contribution of this paper is two-fold. First, we present an optimization model for the problem of mapping data streams over fog nodes, considering not only the current load of the fog nodes, but also the communication latency between sensors and fog nodes. Second, to address the complexity of the problem, we present a scalable heuristic based on genetic algorithms. We carried out a set of experiments based on a realistic smart city scenario: the results show how the performance of the proposed heuristic is comparable with the one achieved through the solution of the optimization problem. Then, we carried out a comparison among different genetic evolution strategies and operators that identify the uniform crossover as the best option. Finally, we perform a wide sensitivity analysis to show the stability of the heuristic performance with respect to its main parameters.

2019 - PAFFI: Performance Analysis Framework for Fog Infrastructures in realistic scenarios [Relazione in Atti di Convegno]
Canali, C.; Lancellotti, R.
abstract

The growing popularity of applications involving the process of a huge amount of data and requiring high scalability and low latency represents the main driver for the success of the fog computing paradigm. A set of fog nodes close to the network edge and hosting functions such as data aggregation, filtering or latency sensitive applications can avoid the risk of high latency due to geographic data transfer and network links congestion that hinder the viability of the traditional cloud computing paradigm for a class of applications including support for smart cities services or autonomous driving. However, the design of fog infrastructures requires novel techniques for system modeling and performance evaluation able to capture a realistic scenario starting from the geographic location of the infrastructure elements. In this paper we propose PAFFI, a framework for the performance analysis of fog infrastructures in realistic scenarios. We describe the main features of the framework and its capability to automatically generate realistic fog topologies, with an optimized mapping between sensors, fog nodes and cloud data centers, whose performance can be evaluated by means of simulation.

2018 - An Approach to Balance Maintenance Costs and Electricity Consumption in Cloud Data Centers [Articolo su rivista]
Chiaraviglio, Luca; D'Andreagiovanni, Fabio; Lancellotti, Riccardo; Shojafar, Mohammad; Blefari Melazzi, Nicola; Canali, Claudia
abstract

We target the problem of managing the power states of the servers in a Cloud Data Center (CDC) to jointly minimize the electricity consumption and the maintenance costs derived from the variation of power (and consequently of temperature) on the servers' CPU. More in detail, we consider a set of virtual machines (VMs) and their requirements in terms of CPU and memory across a set of Time Slot (TSs). We then model the consumed electricity by taking into account the VMs processing costs on the servers, the costs for transferring data between the VMs, and the costs for migrating the VMs across the servers. In addition, we employ a material-based fatigue model to compute the maintenance costs needed to repair the CPU, as a consequence of the variation over time of the server power states. After detailing the problem formulation, we design an original algorithm, called Maintenance and Electricity Costs Data Center (MECDC), to solve it. Our results, obtained over several representative scenarios from a real CDC, show that MECDC largely outperforms two reference algorithms, which instead either target the load balancing or the energy consumption of the servers.

2018 - An Optimization Model to Reduce Energy Consumption in Software-Defined Data Centers [Relazione in Atti di Convegno]
Canali, Claudia; Lancellotti, Riccardo; Shojafar, Mohammad
abstract

The increasing popularity of Software-Defined Network technologies is shaping the characteristics of present and future data centers. This trend, leading to the advent of Software-Defined Data Centers, will have a major impact on the solutions to address the issue of reducing energy consumption in cloud systems. As we move towards a scenario where network is more flexible and supports virtualization and softwarization of its functions, energy management must take into account not just computation requirements but also network related effects, and must explicitly consider migrations throughout the infrastructure of Virtual Elements (VEs), that can be both Virtual Machines and Virtual Routers. Failing to do so is likely to result in a sub-optimal energy management in current cloud data centers, that will be even more evident in future SDDCs. In this chapter, we propose a joint computation-plus-communication model for VEs allocation that minimizes energy consumption in a cloud data center. The model contains a threefold contribution. First, we consider the data exchanged between VEs and we capture the different connections within the data center network. Second, we model the energy consumption due to VEs migrations considering both data transfer and computational overhead. Third, we propose a VEs allocation process that does not need to introduce and tune weight parameters to combine the two (often conflicting) goals of minimizing the number of powered-on servers and of avoiding too many VE migrations. A case study is presented to validate our proposal. We apply our model considering both computation and communication energy contributions even in the migration process, and we demonstrate that that our proposal outperforms the existing alternatives for VEs allocation in terms of energy reduction.

2018 - Designing a private CDN with an off-sourced network infrastructure: model and case study [Relazione in Atti di Convegno]
Canali, Claudia; Corbelli, Andrea; Lancellotti, Riccardo
abstract

Content Delivery Networks for multimedia contents are typically managed by a dedicated company. However, there are cases where an enterprise already investing in a dedicated network infrastructure wants to deploy its own private CDN. This scenario is quite different from traditional CDNs for a twofold reason: first, the workload characteristics; second, the impact on the available choices for the CDN design of having the management of the network infrastructure off-sourced to a third party. The contribution of this paper is to introduce and discuss the optimization models used to design the private CDN and to validate our models using a case study.

2018 - Joint Minimization of the Energy Costs from Computing, Data Transmission, and Migrations in Cloud Data Centers [Articolo su rivista]
Canali, Claudia; Chiaraviglio, Luca; Lancellotti, Riccardo; Shojafar, Mohammad
abstract

We propose a novel model, called JCDME, for the allocation of Virtual Elements (VEs), with the goal of minimizing the energy consumption in a Software-Defined Cloud Data Center (SDDC). More in detail, we model the energy consumption by considering the computing costs of the VEs on the physical servers, the costs for migrating VEs across the servers, and the costs for transferring data between VEs. In addition, JCDME introduces a weight parameter to avoid an excessive number of VE migrations. Specifically, we propose three different strategies to solve the JCDME problem with an automatic and adaptive computation of the weight parameter for the VEs migration costs. We then evaluate the considered strategies over a set of scenarios, ranging from a small sized SDDC up to a medium sized SDDC composed of hundreds of VEs and hundreds of servers. Our results demonstrate that JCDME is able to save up to an additional 7% of energy w.r.t. previous energy-aware algorithms, without a substantial increase in the solution complexity.

2018 - On private CDNs with off-sourced network infrastructures: A model and a case study [Articolo su rivista]
Canali, Claudia; Corbelli, Andrea; Lancellotti, Riccardo
abstract

The delivery of multimedia contents through a Content Delivery Network (CDN) is typically handled by a specific third party, separated from the content provider. However, in some specific cases, the content provider may be interested in carrying out this function using a Private CDN, possibly using an off-sourced network infrastructure. This scenario poses new challenges and limitations with respect to the typical case of content delivery. First, the systems has to face a different workload as the content consumer are typically part of the same organization that is the content provider. Second, the offsourced nature of the network infrastructure has a major impact on the available choices for CDN design. In this paper we develop an exact mathematical model for the design of a Private CDN addressing the issues and the constraints typical of such scenario. Furthermore, we analyze different heuristics to solve the optimization problem. We apply the proposed model to a real case study and validate the results by means of simulation.

2018 - Special issue on algorithms for the resource management of large scale infrastructures [Articolo su rivista]
Ardagna, Danilo; Canali, Claudia; Lancellotti, Riccardo
abstract

Modern distributed systems are becoming increasingly complex as virtualization is being applied at both the levels of computing and networking. Consequently, the resource management of this infrastructure requires innovative and efficient solutions. This issue is further exacerbated by the unpredictable workload of modern applications and the need to limit the global energy consumption. The purpose of this special issue is to present recent advances and emerging solutions to address the challenge of resource management in the context of modern large-scale infrastructures. We believe that the four papers that we selected present an up-to-date view of the emerging trends, and the papers propose innovative solutions to support efficient and self-managing systems that are able to adapt, manage, and cope with changes derived from continually changing workload and application deployment settings, without the need for human supervision.

2017 - A Computation- and Network-Aware Energy Optimization Model for Virtual Machines Allocation [Relazione in Atti di Convegno]
Canali, Claudia; Lancellotti, Riccardo; Shojafar, Mohammad
abstract

Reducing energy consumption in cloud data center is a complex task, where both computation and network related effects must be taken into account. While existing solutions aim to reduce energy consumption considering separately computational and communication contributions, limited attention has been devoted to models integrating both parts. We claim that this lack leads to a sub-optimal management in current cloud data centers, that will be even more evident in future architectures characterized by Software-Defined Network approaches. In this paper, we propose a joint computation-plus-communication model for Virtual Machines (VMs) allocation that minimizes energy consumption in a cloud data center. The contribution of the proposed model is threefold. First, we take into account data traffic exchanges between VMs capturing the heterogeneous connections within the data center network. Second, the energy consumption due to VMs migrations is modeled by considering both data transfer and computational overhead. Third, the proposed VMs allocation process does not rely on weight parameters to combine the two (often conflicting) goals of tightly packing VMs to minimize the number of powered-on servers and of avoiding an excessive number of VM migrations. An extensive set of experiments confirms that our proposal, which considers both computation and communication energy contributions even in the migration process, outperforms other approaches for VMs allocation in terms of energy reduction.

2017 - A Correlation-based Methodology to Infer Communication Patterns between Cloud Virtual Machines [Relazione in Atti di Convegno]
Canali, Claudia; Lancellotti, Riccardo
abstract

The VMs allocation over the servers of a cloud data center is becoming a critical task to guarantee energy savings and high performance. Only recently network-aware techniques for VMs allocation have been proposed. However, a network-aware placement requires the knowledge of data transfer patterns between VMs, so that VMs exchanging significant amount of information can be placed on low cost communication paths (e.g. on the same server). The knowledge of this information is not easy to obtain unless a specialized monitoring function is deployed over the data center infrastructure. In this paper, we propose a correlation-based methodology that aims to infer communication patterns starting from the network traffic time series of each VM without relaying on a special purpose monitoring. Our study focuses on the case where a data center hosts a multi-tier application deployed using horizontal replication. This typical case of application deployment makes particularly challenging the identification of VMs communications because the traffic patterns are similar in every VM belonging to the same application tier. In the evaluation of the proposed methodology, we compare different correlation indexes and we consider different time granularities for the monitoring of network traffic. Our study demonstrates the feasibility of the proposed approach, that can identify which VMs are interacting among themselves even in the challenging scenario considered in our experiments.

2017 - A measurement-based analysis of temperature variations introduced by power management on Commodity HardWare [Relazione in Atti di Convegno]
Chiaraviglio, Luca; Blefari-Melazzi, Nicola; Canali, Claudia; Cuomo, Francesca; Lancellotti, Riccardo; Shojafar, Mohammad
abstract

Commodity HardWare (CHW) is currently used in the Internet to deploy large data centers or small computing nodes. Moreover, CHW will be also used to deploy future telecommunication networks, thanks to the adoption of the forthcoming network softwarization paradigm. In this context, CHW machines can be put in Active Mode (AM) or in Sleep Mode (SM) several times per day, based on the traffic requirements from users. However, the transitions between the power states may introduce fatigue effects, which may increase the CHW maintenance costs. In this paper, we perform a measurement campaign of a CHW machine subject to power state changes introduced by SM. Our results show that the temperature change due to power state transitions is not negligible, and that the abrupt stopping of the fans on hot components (such as the CPU) tends to spread the heat over the other components of the CHW machine. In addition, we also show that the CHW failure rate is reduced by a factor of 5 when the number of transitions between AM and SM states is more than 20 per day and the SM duration is around 800 [s].

2017 - Identifying Communication Patterns between Virtual Machines in Software-Defined Data Centers [Relazione in Atti di Convegno]
Canali, Claudia; Lancellotti, Riccardo
abstract

Modern cloud data centers typically exploit management strategies to reduce the overall energy consumption. While most of the solutions focus on the energy consumption due to computational elements, the advent of the Software-Defined Network paradigm opens the possibility for more complex strategies taking into account the network traffic exchange within the data center. However, a network-aware Virtual Machine (VM) allocation requires the knowledge of data communication patterns, so that VMs exchanging significant amount of data can be placed on the same physical host or on low cost communication paths. In Infrastructure as a Service data centers, the information about VMs traffic exchange is not easily available unless a specialized monitoring function is deployed over the data center infrastructure. The main contribution of this paper is a methodology to infer VMs communication patterns starting from input/output network traffic time series of each VM and without relaying on a special purpose monitoring. Our reference scenario is a software-defined data center hosting a multi-tier application deployed using horizontal replication. The proposed methodology has two main goals to support a network-aware VMs allocation: first, to identify couples of intensively communicating VMs through correlation-based analysis of the time series; second, to identify VMs belonging to the same vertical stack of a multi-tier application. We evaluate the methodology by comparing different correlation indexes, clustering algorithms and time granularities to monitor the network traffic. The experimental results demonstrate the capability of the proposed approach to identify interacting VMs, even in a challenging scenario where the traffic patterns are similar in every VM belonging to the same application tier.

2017 - Scalable and automatic virtual machines placement based on behavioral similarities [Articolo su rivista]
Canali, Claudia; Lancellotti, Riccardo
abstract

The success of the cloud computing paradigm is leading to a significant growth in size and complexity of cloud data centers. This growth exacerbates the scalability issues of the Virtual Machines (VMs) placement problem, that assigns VMs to the physical nodes of the infrastructure. This task can be modeled as a multi-dimensional bin-packing problem, with the goal to minimize the number of physical servers (for economic and environmental reasons), while ensuring that each VM can access the resources required in the next future. Unfortunately, the naïve bin packing problem applied to a real data center is not solvable in a reasonable time because the high number of VMs and of physical nodes makes the problem computationally unmanageable. Existing solutions improve scalability at the expense of solution quality, resulting in higher costs and heavier environmental footprint. The Class-Based placement technique (CBP) is a novel approach that exploits existing solutions to automatically group VMs showing similar behaviour. The Class-Based technique solves a placement problem that considers only some representative VMs for each class, and that can be replicated as a building block to solve the global VMs placement problem. Using real traces, we analyse our proposal performance, comparing different alternatives to automatically determine the number of building blocks. Furthermore, we compare our proposal against the existing alternatives and evaluate the results for different workload compositions. We demonstrate that the CBP proposal outperforms existing solutions in terms of scalability and VM placement quality.

2016 - A comparison of techniques to detect similarities in cloud virtual machines [Articolo su rivista]
Canali, Claudia; Lancellotti, Riccardo
abstract

Scalability in monitoring and management of cloud data centres may be improved through the clustering of virtual machines (VMs) exhibiting similar behaviour. However, available solutions for automatic VM clustering present some important drawbacks that hinder their applicability to real cloud scenarios. For example, existing solutions show a clear trade-off between the accuracy of the VMs clustering and the computational cost of the automatic process; moreover, their performance shows a strong dependence on specific technique parameters. To overcome these issues, we propose a novel approach for VM clustering that uses Mixture of Gaussians (MoGs) together with the Kullback-Leiber divergence to model similarity between VMs. Furthermore, we provide a thorough experimental evaluation of our proposal and of existing techniques to identify the most suitable solution for different workload scenarios.

2016 - An energy-aware scheduling algorithm in DVFS-Enabled Networked Data Centers [Relazione in Atti di Convegno]
Shojafar, Mohammad; Canali, Claudia; Lancellotti, Riccardo; Abolfazli, Saeid
abstract

In this paper, we propose an adaptive online energy-aware scheduling algorithm by exploiting the reconfiguration capability of a Virtualized Networked Data Centers (VNetDCs) processing large amount of data in parallel. To achieve energy efficiency in such intensive computing scenarios, a joint balanced provisioning and scaling of the networking-plus-computing resources is required. We propose a scheduler that manages both the incoming workload and the VNetDC infrastructure to minimize the communication-plus-computing energy dissipated by processing incoming traffic under hard real-time constraints on the per-job computing-plus-communication delays. Specifically, our scheduler can distribute the workload among multiple virtual machines (VMs) and can tune the processor frequencies and the network bandwidth. The energy model used in our scheduler is rather sophisticated and takes into account also the internal/external frequency switching energy costs. Our experiments demonstrate that the proposed scheduler guarantees high quality of service to the users respecting the service level agreements. Furthermore, it attains minimum energy consumptions under two real-world operating conditions: a discrete and finite number of CPU frequencies and not negligible VMs reconfiguration costs. Our results confirm that the overall energy savings of data center can be significantly higher with respect to the existing solutions.

2016 - Minimizing computing-plus-communication energy consumptions in virtualized networked data centers [Relazione in Atti di Convegno]
Shojafar, Mohammad; Canali, Claudia; Lancellotti, Riccardo; Baccarelli, Enzo
abstract

In this paper, we propose a dynamic resource provisioning scheduler to maximize the application throughput and minimize the computing-plus-communication energy consumption in virtualized networked data centers. The goal is to maximize the energy-efficiency, while meeting hard QoS requirements on processing delay. The resulting optimal resource scheduler is adaptive, and jointly performs: i) admission control of the input traffic offered by the cloud provider; ii) adaptive balanced control and dispatching of the admitted traffic; iii) dynamic reconfiguration and consolidation of the Dynamic Voltage and Frequency Scaling (DVFS)-enabled virtual machines instantiated onto the virtualized data center. The proposed scheduler can manage changes of the workload without requiring server estimation and prediction of its future trend. Furthermore, it takes into account the most advanced mechanisms for power reduction in servers, such as DVFS and reduced power states. Performance of the proposed scheduler is numerically tested and compared against the corresponding ones of some state-of-the-art schedulers, under both synthetically generated and measured real-world workload traces. The results confirm the delay-vs.-energy good performance of the proposed scheduler.

2015 - A Class-based Virtual Machine Placement Technique for a Greener Cloud [Relazione in Atti di Convegno]
Canali, Claudia; Lancellotti, Riccardo
abstract

The management of IaaS cloud systems is a challenging task, where a huge number of Virtual Machines (VMs) must be placed over a physical infrastructure with multiple nodes. Economical reasons and the need to reduce the ever-growing carbon footprint of modern data centers require an efficient VMs placement that minimizes the number of physical required nodes. As each VM is considered as a black box with independent characteristics, the placement process presents scalability issues due to the amount of involved data and to the resulting number of constraints in the underlying optimization problem. For large data centers, this excludes the possibility to reach an optimal allocation. Existing solutions typically exploit heuristics or simplified formulations to solve the allocation problem, at the price of possibly sub-optimal solutions. We introduce a novel placement technique, namely Class-Based, that exploits available solutions to automatically group VMs showing similar behavior. The Class-Based technique solves a placement problem that considers only some representatives for each class, and that can be replicated as a building block to solve the global VMs placement problem. Our experiments demonstrate that the proposed technique is a viable solution that can significantly improve the scalability of the VMs placement in IaaS Cloud systems with respect to existing alternatives.

2015 - A scalable monitor for large systems [Relazione in Atti di Convegno]
Andreolini, M.; Pietri, M.; Tosi, S.; Lancellotti, R.
abstract

Current monitoring solutions are not well suited to monitoring large data centers in different ways: lack of scalability, scarce representativity of global state conditions, inability in guaranteeing persistence in service delivery, and the impossibility of monitoring multitenant applications. In this paper, we present a novel monitoring architecture that strives to address these problems. It integrates a hierarchical scheme to monitor the resources in a cluster with a distributed hash table (DHT) to broadcast system state information among different monitors. This architecture strives to obtain high scalability, effectiveness and resilience, as well as the possibility of monitoring services spanning across different clusters or even different data centers of the cloud provider. We evaluate the scalability of the proposed architecture through an experimental analysis and we measure the overhead of the DHT-based communication scheme.

2015 - Automatic parameter tuning for Class-Based Virtual Machine Placement in cloud infrastructures [Relazione in Atti di Convegno]
Canali, Claudia; Lancellotti, Riccardo
abstract

A critical task in the management of Infrastructure as a Service cloud data centers is the placement of Virtual Machines (VMs) over the infrastructure of physical nodes. However, as the size of data centers grows, finding optimal VM placement solutions becomes challenging. The typical approach is to rely on heuristics that improve VM placement scalability by (partially) discarding information about the VM behavior. An alternative approach providing encouraging results, namely Class-Based Placement (CBP), has been proposed recently. CBP considers VMs divided in classes with similar behavior in terms of resource usage. This technique can obtain high quality placement because it considers a detailed model of VM behavior on a per-class base. At the same time, scalability is achieved by considering a small-scale VM placement problem that is replicated as a building block for the whole data center. However, a critical parameter of CBP technique is the number (and size) of building blocks to consider. Many small building blocks may reduce the overall VM placement solution quality due to fragmentation of the physical node resources over blocks. On the other hand, few large building blocks may become computationally expensive to handle and may be unsolvable due to the problem complexity. This paper addresses this problem analyzing the impact of block size on the performance of the VM class-based placement. Furthermore, we propose an algorithm to estimate the best number of blocks. Our proposal is validated through experimental results based on a real cloud computing data center.

2015 - Exploiting Classes of Virtual Machines for Scalable IaaS Cloud Management [Relazione in Atti di Convegno]
Canali, Claudia; Lancellotti, Riccardo
abstract

A major challenge of IaaS cloud data centers is the placement of a huge number of Virtual Machines (VMs) over a physical infrastructure with a high number of nodes. The VMs placement process must strive to reduce as much as possible the number of physical nodes to improve management efficiency, reduce energy consumption and guarantee economical savings. However, since each VM is considered as a black box with independent characteristics, the VMs placement task presents scalability issues due to the amount of involved data and to the resulting number of constraints in the underlying optimization problem. For large data centers, this condition often leads to the impossibility to reach an optimal solution for VMs placement. Existing solutions typically exploit heuristics or simplified formulations to solve the placement problem, at the price of possibly sub-optimal solutions. We propose an innovative VMs placement technique, namely Class-Based, that takes advantage from existing solutions to automatically group VMs showing similar behavior. The Class-Based technique solves a placement problem that considers only some representatives for each class, and that can be replicated as a building block to solve the global VMs placement problem. Our experiments demonstrate that the proposed technique is viable and can significantly improve the scalability of the VMs placement in IaaS Cloud systems with respect to existing alternatives.

2015 - Parameter tuning for scalable multi-resource server consolidation in cloud systems [Articolo su rivista]
Canali, Claudia; Lancellotti, Riccardo
abstract

Infrastructure as a Service cloud providers are increasingly relying on scalable and efficient Virtual Machines (VMs) placement as the main solution for reducing unnecessary costs and wastes of physical resources. However, the continuous growth of the size of cloud data centers poses scalability challenges to find optimal placement solutions. The use of heuristics and simplified server consolidation models that partially discard information about the VMs behavior represents the typical approach to guarantee scalability, but at the expense of suboptimal placement solutions. A recently proposed alternative approach, namely Class-Based Placement (CBP), divides VMs in classes with similar behavior in terms of resource usage, and addresses scalability by considering a small-scale server consolidation problem that is replicated as a building block for the whole data center. However, the server consolidation model exploited by the CBP technique suffers from two main limitations. First, it considers only one VM resource (CPU) for the consolidation problem. Second, it does not analyze the impact of the number (and size) of building blocks to consider. Many small building blocks may reduce the overall VMs placement solution quality due to fragmentation of the physical server resources over blocks. On the other hand, few large building blocks may become computationally expensive to handle and may be unsolvable due to the problem complexity. This paper extends the CBP server consolidation model to take into account multiple resources. Furthermore, we analyze the impact of block size on the performance of the proposed consolidation model, and we present and compare multiple strategies to estimate the best number of blocks. Our proposal is validated through experimental results based on a real cloud computing data center.

2014 - A Receding Horizon Approach for the Runtime Management of IaaS Cloud Systems [Relazione in Atti di Convegno]
Ardagna, Danilo; Ciavotta, Michele; Lancellotti, Riccardo
abstract

Cloud Computing is emerging as a major trend in ICT industry. However, as with any new technology it raises new major challenges and one of them concerns the resource provisioning. Indeed, modern Cloud applications deal with a dynamic context and have to constantly adapt themselves in order to meet Quality of Service (QoS) requirements. This situation calls for advanced solutions designed to dynamically provide cloud resource with the aim of guaranteeing the QoS levels. This work presents a capacity allocation algorithm whose goal is to minimize the total execution cost, while satisfying some constraints on the average response time of Cloud based applications. We propose a receding horizon control technique, which can be employed to handle multiple classes of requests. We compare our solution with an oracle with perfect knowledge of the future and with a well-known heuristic described in the literature. The experimental results demonstrate that our solution outperforms the existing heuristic producing results very close to the optimal ones. Furthermore, a sensitivity analysis over two different time scales indicates that finer grained time scales are more appropriate for spiky workloads, whereas smooth traffic conditions are better handled by coarser grained time scales. Our analytical results are also validated through simulation, which shows also the impact on our solution of Cloud environment random perturbations.

2014 - An Adaptive Technique to Model Virtual Machine Behavior for Scalable Cloud Monitoring [Relazione in Atti di Convegno]
Canali, Claudia; Lancellotti, Riccardo
abstract

Supporting the emerging digital society is creating new challenges for cloud computing infrastructures, exacerbating scalability issues regarding the processes of resource monitoring and management in large cloud data centers. Recent research studies show that automatically clustering similar virtual machines running the same software component may improve the scalability of the monitoring process in IaaS cloud systems. However, to avoid misclassifications, the clustering process must take into account long time series (up to weeks) of resource measurements, thus resulting in a mechanism that is slow and not suitable for a cloud computing model where virtual machines may be frequently added or removed in the data center. In this paper, we propose a novel methodology that dynamically adapts the length of the time series necessary to correctly cluster each VM depending on its behavior. This approach supports a clustering process that does not have to wait a long time before making decisions about the VM behavior. The proposed methodology exploits elements of fuzzy logic for the dynamic determination of time series length. To evaluate the viability of our solution, we apply the methodology to a case study considering different algorithms for VMs clustering. Our results confirm that after just 1 day of monitoring we can cluster without misclassifications up to 80% of the VMs, while for the remaining 20% of the VMs longer observations are needed.

2014 - Balancing Accuracy and Execution Time for Similar Virtual Machines Identification in IaaS Cloud [Relazione in Atti di Convegno]
Canali, Claudia; Lancellotti, Riccardo
abstract

Identification of VMs exhibiting similar behavior can improve scalability in monitoring and management of cloud data centers. Existing solutions for automatic VM clustering may be either very accurate, at the price of a high computational cost, or able to provide fast results with limited accuracy. Furthermore, the performance of most solutions may change significantly depending on the specific values of technique parameters. In this paper, we propose a novel approach to model VM behavior using Mixture of Gaussians (MoGs) to approximate the probability density function of resources utilization. Moreover, we exploit the Kullback-Leibler divergence to measure the similarity between MoGs. The proposed technique is compared against the state of the art through a set of experiments with data coming from a private cloud data center. Our experiments show that the proposed technique can provide high accuracy with limited computational requirements. Furthermore, we show that the performance of our proposal, unlike the existing alternatives, does not depend on any parameter

2014 - Detecting Similarities in Virtual Machine Behavior for Cloud Monitoring using Smoothed Histograms [Articolo su rivista]
Lancellotti, Riccardo; Canali, Claudia
abstract

The growing size and complexity of cloud systems determine scalability issues for resource monitoring and management. While most existing solutions con- sider each Virtual Machine (VM) as a black box with independent characteristics, we embrace a new perspective where VMs with similar behaviors in terms of resource usage are clustered together. We argue that this new approach has the potential to address scalability issues in cloud monitoring and management. In this paper, we propose a technique to cluster VMs starting from the usage of multiple resources, assuming no knowledge of the services executed on them. This innovative technique models VMs behavior exploiting the probability histogram of their resources usage, and performs smoothing-based noise reduction and selection of the most relevant information to consider for the clustering process. Through extensive evaluation, we show that our proposal achieves high and stable performance in terms of automatic VM clustering, and can reduce the monitoring requirements of cloud systems.

2014 - Exploiting ensemble techniques for automatic virtual machine clustering in cloud systems [Articolo su rivista]
Canali, Claudia; Lancellotti, Riccardo
abstract

Cloud computing has recently emerged as a new paradigm to provide computing services through large-size data centers where customers may run their applications in a virtualized environment. The advantages of cloud in terms of flexibility and economy encourage many enterprises to migrate from local data centers to cloud platforms, thus contributing to the success of such infrastructures. However, as size and complexity of cloud infrastructures grow, scalability issues arise in monitoring and management processes. Scalability issues are exacerbated because available solutions typically consider each virtual machine (VM) as a black box with independent characteristics, which is monitored at a fine-grained granularity level for management purposes, thus generating huge amounts of data to handle. We claim that scalability issues can be addressed by leveraging the similarity between VMs in terms of resource usage patterns. In this paper, we propose an automated methodology to cluster similar VMs starting from their resource usage information, assuming no knowledge of the software executed on them. This is an innovative methodology that combines the Bhattacharyya distance and ensemble techniques to provide a stable evaluation of similarity between probability distributions of multiple VM resource usage, considering both system- and network-related data. We evaluate the methodology through a set of experiments on data coming from an enterprise data center. We show that our proposal achieves high and stable performance in automatic VMs clustering, with a significant reduction in the amount of data collected which allows to lighten the monitoring requirements of a cloud data center.

2014 - Improving scalability of cloud monitoring through PCA-based Clustering of Virtual Machines [Articolo su rivista]
Canali, Claudia; Lancellotti, Riccardo
abstract

Cloud computing has recently emerged as a leading paradigm to allow customers to run their applications in virtualized large-scale data centers. Existing solutions for monitoring and management of these infrastructures consider virtual machines (VMs) as independent entities with their own characteristics. However, these approaches suffer from scalability issues due to the increasing number of VMs in modern cloud data centers. We claim that scalability issues can be addressed by leveraging the similarity among VMs behavior in terms of resource usage patterns. In this paper we propose an automated methodology to cluster VMs starting from the usage of multiple resources, assuming no knowledge of the services executed on them. The innovative contribution of the proposed methodology is the use of the statistical technique known as principal component analysis (PCA) to automatically select the most relevant information to cluster similar VMs. We apply the methodology to two case studies, a virtualized testbed and a real enterprise data center. In both case studies, the automatic data selection based on PCA allows us to achieve high performance, with a percentage of correctly clustered VMs between 80% and 100% even for short time series (1 day) of monitored data. Furthermore, we estimate the potential reduction in the amount of collected data to demonstrate how our proposal may address the scalability issues related to monitoring and management in cloud computing data centers.

2013 - Algorithms for Web Service Selection with Static and Dynamic Requirements [Articolo su rivista]
Canali, Claudia; Colajanni, Michele; Lancellotti, Riccardo
abstract

A main feature of Service Oriented Architectures is the capability to support the development of new applications through the composition of existing Web services that are offered by different service providers. The runtime selection of which providers may better satisfy the end-user requirements in terms of quality of service remains an open issue in the context of Web services. The selection of the service providers has to satisfy requirements of different nature: requirements may refer to static qualities of the service providers, which do not change over time or change slowly compared to the service invocation time (for example related to provider reputation), and to dynamic qualities, which may change on a per-invocation basis (typically related to performance, such as the response time). The main contribution of this paper is to propose a family of novel runtime algorithms that select service providers on the basis of requirements involving both static and dynamic qualities, as in a typical Web scenario. We implement the proposed algorithms in a prototype and compare them with the solutions commonly used in service selection, which consider all the service provider qualities as static for the scope of the selection process. Our experiments show that a static management of quality requirements is viable only in the unrealistic case where workload remains stable over time, but it leads to very poor performance in variable environments. On the other hand, the combined management of static and dynamic quality requirements allows us to achieve better user-perceived performance over a wide range of scenarios, with the response time of the proposed algorithms that is reduced up to a 50 % with respect to that of static algorithms.

2013 - Automatic virtual machine clustering based on bhattacharyya distance for multi-cloud systems [Relazione in Atti di Convegno]
Canali, Claudia; Lancellotti, Riccardo
abstract

Size and complexity of modern data centers pose scalability issues for the resource monitoring system supporting management operations, such as server consolidation. When we pass from cloud to multi-cloud systems, scalability issues are exacerbated by the need to manage geographically distributed data centers and exchange monitored data across them. While existing solutions typically consider every Virtual Machine (VM) as a black box with independent characteristics, we claim that scalability issues in multi-cloud systems could be addressed by clustering together VMs that show similar behaviors in terms of resource usage. In this paper, we propose an automated methodology to cluster VMs starting from the usage of multiple resources, assuming no knowledge of the services executed on them. This innovative methodology exploits the Bhattacharyya distance to measure the similarity of the probability distributions of VM resources usage, and automatically selects the most relevant resources to consider for the clustering process. The methodology is evaluated through a set of experiments with data from a cloud provider. We show that our proposal achieves high and stable performance in terms of automatic VM clustering. Moreover, we estimate the reduction in the amount of data collected to support system management in the considered scenario, thus showing how the proposed methodology may reduce the monitoring requirements in multi-cloud systems.

2012 - A quantitative methodology based on component analysis to identify key users in social networks [Articolo su rivista]
Canali, Claudia; Lancellotti, Riccardo
abstract

Social networks are gaining an increasing popularity on the Internet and are becoming a critical media for business and marketing. Hence, it is important to identify the key users that may play a critical role as sources or targets of content dissemination. Existing approaches rely only on users social connections; however, considering a single kind of information does not guarantee satisfactory results for the identification of the key users. On the other hand, considering every possible user attribute is clearly unfeasible due to huge amount of heterogenous user information. In this paper, we propose to select and combine a subset of user attributes with the goal to identify sources and targets for content dissemination in a social network. We develop a quantitative methodology based on the principal component analysis. Experiments on the YouTube and Flickr networks demonstrate that our solution outperforms existing solutions by 15%.

2012 - Automated Clustering of Virtual Machines based on Correlation of Resource Usage [Articolo su rivista]
Canali, Claudia; Lancellotti, Riccardo
abstract

The recent growth in demand for modern applications combined with the shift to the Cloud computing paradigm have led to the establishment of large-scale cloud data centers. The increasing size of these infrastructures represents a major challenge in terms of monitoring and management of the system resources. Available solutions typically consider every Virtual Machine (VM) as a black box each with independent characteristics, and face scalability issues by reducing the number of monitored resource samples, considering in most cases only average CPU usage sampled at a coarse time granularity. We claim that scalability issues can be addressed by leveraging the similarity between VMs in terms of resource usage patterns. In this paper we propose an automated methodology to cluster VMs depending on the usage of multiple resources, both system- and network-related, assuming no knowledge of the services executed on them. This is an innovative methodology that exploits the correlation between the resource usage to cluster together similar VMs. We evaluate the methodology through a case study with data coming from an enterprise datacenter, and we show that high performance may be achieved in automatic VMs clustering. Furthermore, we estimate the reduction in the amount of data collected, thus showing that our proposal may simplify the monitoring requirements and help administrators to take decisions on the resource management of cloud computing datacenters.

2012 - Automated clustering of VMs for scalable cloud monitoring and management [Relazione in Atti di Convegno]
Canali, Claudia; Lancellotti, Riccardo
abstract

The size of modern datacenters supporting cloud computing represents a major challenge in terms of monitoring and management of system resources. Available solutions typically consider every Virtual Machine (VM) as a black box each with independent characteristics and face scalability issues by reducing the number of monitoring re- source samples, considering in most cases only average CPU utilization of VMs sampled at a very coarse time granularity. We claim that better management without compromising scalability could be achieved by clustering together VMs that show similar behavior in terms of resource utilization. In this paper we propose an automated methodology to cluster VMs depending on the utilization of their resources, assuming no knowledge of the services executed on them. The methodology considers several VM resources, both system- and network-related, and exploits the correlation between the resource demand to cluster together similar VMs. We apply the proposed methodology to a case study with data coming from an enterprise datacenter to evaluate the accuracy of VMs clustering and to estimate the reduction in the amount of data collected. The automatic clustering achieved through our approach may simplify the monitoring requirements and help administrators to take decisions on the management of the resources in a cloud computing datacenter.

2011 - Assessing the overhead and scalability of system monitors for large data centers [Relazione in Atti di Convegno]
Andreolini, Mauro; Colajanni, Michele; Lancellotti, Riccardo
abstract

Current data centers are shifting towards cloud-based architectures as a means to obtain a scalable, cost-effective, robust service platform. In spite of this, the underlying management infrastructure has grown in terms of hardware resources and software complexity, making automated resource monitoring a necessity.There are several infrastructure monitoring tools designed to scale to a very high number of physical nodes. However, these tools either collect performance measure at a low frequency (missing the chance to capture the dynamics of a short-term management task) or are simply not equipped with instrumentation specific to cloud computing and virtualization. In this scenario, monitoring the correctness and efficiency of live migrations can become a nightmare. This situation will only worsen in the future, with the increased service demand due to spreading of the user base.In this paper, we assess the scalability of a prototype monitoring subsystem for different user scenarios. We also identify all the major bottlenecks and give insight on how to remove them.

2011 - Data Acquisition in Social Networks: Issues and Proposals [Relazione in Atti di Convegno]
Canali, Claudia; Colajanni, Michele; Lancellotti, Riccardo
abstract

The amount of information that is possible to gather from social networks may be useful to different contexts ranging from marketing to intelligence. In this paper, we describe the three main techniques for data acquisition in social networks, the conditions under which they can be applied, and the open problems.We then focus on the main issues that crawlers have to address for getting data from social networks, and we propose a novel solution that exploits the cloud computing paradigm for crawling. The proposed crawler is modular by design and relies on a large number of distributed nodes and on the MapReduce framework to speedup the data collection process from large social networks.

2011 - Dynamic request management algorithms for Web-based services in cloud computing [Relazione in Atti di Convegno]
Lancellotti, Riccardo; Andreolini, Mauro; Canali, Claudia; Colajanni, Michele
abstract

Service providers of Web-based services can take advantage ofmany convenient features of cloud computing infrastructures, but theystill have to implement request management algorithms that are able toface sudden peaks of requests. We consider distributed algorithmsimplemented by front-end servers to dispatch and redirect requests amongapplication servers. Current solutions based on load-blind algorithms, orconsidering just server load and thresholds are inadequate to cope with thedemand patterns reaching modern Internet application servers. In thispaper, we propose and evaluate a request management algorithm, namelyPerformanceGain Prediction, that combines several pieces ofinformation (server load, computational cost of a request, usersession migration and redirection delay) to predict whether theredirection of a request to another server may result in a shorterresponse time. To the best of our knowledge, no other studycombines information about infrastructure status, user requestcharacteristics and redirection overhead for dynamic requestmanagement in cloud computing. Our results showthat the proposed algorithm is able to reduce the responsetime with respect to existing request management algorithmsoperating on the basis of thresholds.

2011 - Technological solutions to support Mobile Web 2.0 services [Capitolo/Saggio]
Canali, Claudia; Colajanni, Michele; Lancellotti, Riccardo
abstract

The widespread diffusion and technological improvements of wireless networks and portable devices are facilitating mobile accesses to the Web and Web 2.0 services. The emerging Mobile Web 2.0 scenario still requires appropriate solutions to guarantee user interactions that are comparable with present levels of services. In this chapter we classify the most important services for Mobile Web 2.0, and we identify the key functions that are required to support each category of Mobile Web 2.0 services. We discuss some possible technological solutions to implement these functions at the client and at the server level, and we identify some research issues that are still open.

2010 - A quantitative methodology to identify relevant users in social networks [Relazione in Atti di Convegno]
Canali, Claudia; Casolari, Sara; Lancellotti, Riccardo
abstract

Social networks are gaining an increasing popularity on the Internet, with tens of millions of registered users and an amount of exchanged contents accounting for a large fraction of the Internet traffic. Due to this popularity, social networks are becoming a critical media for business and marketing, as testified by viral advertisement campaigns based on such networks. To exploit the potential of social networks, it is necessary to classify the users in order to identify the most relevant ones.For example, in the context of marketing on social networks, it is necessary to identify which users should be involved in an advertisement campaign.However, the complexity of social networks, where each user is described by a large number of attributes, transforms the problem of identifying relevant users in a needle in a haystack problem. Starting from a set of user attributes that may be redundant or do not provide significant information for our analysis, we need to extract a limited number of meaningful characteristics that can be used to identify relevant users.We propose a quantitative methodology based on Principal Component Analysis (PCA) to analyze attributes and extract characteristics of social network users from the initial attribute set. The proposed methodology can be applied to identify relevant users in social network for different types of analysis. As an application, we present two case studies that show how the proposed methodology can be used to identify relevant users for marketing on the popular YouTube network. Specifically, we identify which users may play a key role in the content dissemination and how users may be affected by different dissemination strategies.

2010 - A two-level distributed architecture for the support of content adaptation and delivery services [Articolo su rivista]
Canali, Claudia; Colajanni, Michele; Lancellotti, Riccardo
abstract

The growing demand for Web and multimedia content accessed through heterogeneous devices requires the providers to tailor resources to the device capabilities on-the-fly. Providing services for content adaptation and delivery opens two novel challenges to the present and future content provider architectures: content adaptation services are computationally expensive; the global storage requirements increase because multiple versions of the same resource may be generated for different client devices. We propose a novel two-level distributed architecture for the support of efficient content adaptation and delivery services. The nodes of the architecture are organized in two levels: thin edge nodes on the first level act as simple request gateways towards the nodes of the second level; fat interior clusters perform all the other tasks, such as content adaptation, caching and fetching. Several experimental results show that the Two-level architecture achieves better performance and scalability than that of existing flat or no cooperative architectures.

2010 - Adaptive algorithms for efficient content management in social network services [Relazione in Atti di Convegno]
Canali, Claudia; Colajanni, Michele; Lancellotti, Riccardo
abstract

Identifying the set of resources that are expected to receive the majority of requests in the near future, namely hot set, is at the basis of most content management strategies of any Web-based service. Here we consider social network services that open interesting novel challenges for the hot set identification. Indeed, social connections among the users and variable user access patterns with continuous operations of resource upload/download determine a highly variable and dynamic context for the stored resources. We propose adaptive algorithms that combine predictive and social information, and dynamically adjust their parameters according to continuously changing workload characteristics. A large set of experimental results show that adaptive algorithms can achieve performance close to theoretical ideal algorithms and, even more important, they guarantee stable results for a wide range of workload scenarios.

2010 - Characteristics and evolution of content popularity and user relations in social networks [Relazione in Atti di Convegno]
Canali, Claudia; Colajanni, Michele; Lancellotti, Riccardo
abstract

Social networks have changed the characteristics of the traditional Web and these changes are still ongoing. Nowadays, it is impossible to design valid strategies for content management, information dissemination and marketing in the context of a social network system without considering the popularity of its content and the characteristics of the relations among its users. By analyzing two popular social networks and comparing current results with studies dating back to 2007 we confirm some previous results and we identify novel trends that can be utilized as a basis for designing appropriate content and system management strategies.Our analyses confirm the growth of the two social networks in terms of quantity of contents and numbers of social links among the users. The social navigation is having an increasing influence on the content popularity because the social links are representing a primary method through which the users search and find contents. An interesting novel trend emerging from our study is that subsets of users have major impact on the content popularity with respect to previous analyses, with evident consequences on the possibility of implementing content dissemination strategies, such as viral marketing.

2010 - Resource Management Strategies for the Mobile Web [Articolo su rivista]
Canali, Claudia; Colajanni, Michele; Lancellotti, Riccardo
abstract

The success of the Mobile Web is driven by the combination of novel Web-based services with the diffusion of advanced mobile devices that require personalization, location-awareness and content adaptation. The evolutionary trend of the Mobile Web workload places unprecedented strains on the server infrastructure of the content provider at the level of computational and storage capacity, to the extent that the technological improvements at the server and client level may be insufficient to face some resource requirements of the future Mobile Web scenario. This paper presents a twofold contribution. We identify some performance bottlenecks that can limit the performance of future Mobile Web, and we propose and evaluate novel resource management strategies. They aim to address computational requirements through a pre-adaptation of the most popular resources even in the presence of irregular access patterns and short resource lifespan that will characterize the future Mobile Web. We investigate a large space of alternative workload scenarios. Our analysis allows to identify when the proposed resource management strategies are able to satisfy the computational requirements of future Mobile Web, and even some conditions where further research is necessary.

2009 - A flexible and robust lookup algorithm for P2P systems [Relazione in Atti di Convegno]
Andreolini, Mauro; Lancellotti, Riccardo
abstract

One of the most critical operations performed in a P2P system is the lookup of a resource. The main issues to be addressed by lookup algorithms are: (1) support for flexible search criteria (e.g., wildcard or multi-keyword searches), (2) effectiveness - i.e., ability to identify all the resources that match the search criteria, (3) efficiency - i.e. low overhead, (4) robustness with respect to node failures and churning. Flood-based P2P networks provide flexible lookup facilities and robust performance at the expense of high overhead, while other systems (e.g. DHT) provide a very efficient lookup mechanism, but lacks flexibility.In this paper, we propose a novel resource lookup algorithm, namely fuzzy-DHT, that solves this trade-off by introducing a flexible and robust lookup criteria based on multiple keywords on top of a distributed hash table algorithm. We demonstrate that the fuzzy-DHT algorithm satisfies all the requirements of P2P lookup systems combining the flexibility of flood-based mechanisms while preserving high efficiency, effectiveness ad robustness.

2009 - Hot set identification for social network applications [Relazione in Atti di Convegno]
Canali, Claudia; Colajanni, Michele; Lancellotti, Riccardo
abstract

Several operations of Web-based applications areoptimized with respect to the set of resources that will receivethe majority of requests in the near future, namely the hotset. Unfortunately, the existing algorithms for the hot setidentification do not work well for the emerging social networkapplications, that are characterized by quite novel featureswith respect to the traditional Web: highly interactive useraccesses, upload and download operations, short lifespan ofthe resources, social interactions among the members of theonline communities.We propose and evaluate innovative combinations of predictivemodels and social-aware solutions for the identificationof the hot set. Experimental results demonstrate that some ofthe considered algorithms improve the accuracy of the hot setidentification up to 30% if compared to existing models, andthey guarantee stable and robust results even in the context ofsocial network applications characterized by high variability.

2009 - Performance Evolution of Mobile Web-Based Services [Articolo su rivista]
Canali, Claudia; Colajanni, Michele; Lancellotti, Riccardo
abstract

The mobile Web's widespread diffusion opens many interesting design and management issues about server infrastructures that must satisfy present and future client demand. Future mobile Web-based services will have growing computational costs. Even requests for the same Web resource will require services to dynamically generate content that takes into account specific devices, user profiles, and contexts. The authors consider the evolution of the mobile Web workload and trends in server and client devices with the goal of anticipating future bottlenecks and developing management strategies.

2008 - Content Delivery and Management [Capitolo/Saggio]
Canali, Claudia; Cardellini, V.; Colajanni, Michele; Lancellotti, Riccardo
abstract

This chapter explores the issues of content delivery through CDNs, with a specialfocus on the delivery of dynamically generated and personalized content. Wedescribe the main functions of a modern Web system and we discuss how the deliveryperformance and scalability can be improved by replicating the functions ofa typical multi-tier Web system over the nodes of a CDN. For each solution, wepresent the state of the art in the research literature, as well as the available industrystandardproducts adopting the solution. Furthermore, we discuss the pros and consof each CDN-based replication solution, pointing out the scenarios that provides thebest benefits and the cases where it is detrimental to performance.

2008 - Impact of Social Networking Services on Performance and Scalability of Web Server Infrastructures [Relazione in Atti di Convegno]
Canali, Claudia; Lancellotti, Riccardo; Sanchez, J.
abstract

The evolution of Internet is heading towards a new generation of social networking services that are characterized by novel access patterns determined by social interactions among the users and by a growing amount of multimedia content involved in each user interaction. The impact of these novel services on the underlying Web infrastructures is significantly different from traditionalWeb-based services and has not yet been widely studied. This paper presents a scalability and bottleneck analysis of a Web system supporting social networking services for different scenarios of user interaction patterns, amount of multimedia content and network characteristics. Our study demonstrates that for some social networking services the user interaction patterns may play a fundamental role in the definition of the bottleneck resource and must be considered in the design of systems supporting novel applications.

2008 - Impact of social networking services on the performance and scalability of web server infrastructures [Relazione in Atti di Convegno]
Canali, C.; Garcia, J. D.; Lancellotti, R.
abstract

The last generation ofWeb is characterized by social networking services where users exchange a growing amount of multimedia content. The impact of these novel services on the underlying Web infrastructures is significantly different from traditional Web-based services and has not yet been widely studied. This paper presents a scalability and bottleneck analysis of a Web system supporting social networking services for different scenarios of user interaction patterns, amount of multimedia content and network characteristics. Our study demonstrates that for some social networking services the user interaction patterns may play a fundamental role in the definition of the bottleneck resource and must be considered in the design of systems supporting novel services. © 2008 IEEE.

2008 - Impact of technology trends on the performance of current and future Web-based systems [Relazione in Atti di Convegno]
Andreolini, Mauro; Colajanni, Michele; Lancellotti, Riccardo
abstract

The hardware technology continues to improve at a considerable rate. Besides the Moore law increments of the CPU speed, in the last years the capacity of the main memory is increasing at an even more impressive rate. One of the consequences of a continuous increment of the main memory resources is the possibility of designing and implementing memory-embedded Web sites in the near future, where both the static resources and the database information is kept in the main memory of the server machines. In this paper, we evaluate the impact of memory and network technology trends on the performance of e-commerce sites that continue to be an important reference for Web-based services in terms of complexity of the hardware/software technology and in terms of performance, availability and scalability requirements. However, most of the achieved considerations can be easily extended to other Webbased services. We demonstrate through experiments on a real system how the system bottlenecks change depending on the amount of memory that is (or will be) available for storing the information of a Web site, taking or not into account the effects of a WAN. This analysis allows us to anticipate some indications about the interventions on the hardware/software components that could improve the capacity of present and future Web-based services.

2008 - Resource management strategies for Mobile Web-based services [Relazione in Atti di Convegno]
Canali, Claudia; Colajanni, Michele; Lancellotti, Riccardo
abstract

The great diffusion of Mobile Web-enabled devices allows the implementation of novel personalization, location and adaptation services that will place unprecedented strains on the server infrastructure of the content provider. This paper has a twofold contribution. First, we analyze the five-years trend of Mobile Web-based applications in terms of workload characteristics of the most popular services and their impact on the server infrastructures. As the technological improvements at the server level in the same period of time are insufficient to face the computational requirements of the future Mobile Web-based services, we propose and evaluate adequate resource management strategies. We demonstrate that pre-adaptating a small fraction of the most popular resources can reduce the response time up to one third thus facing the increased computational impact of the future Mobile Web services.

2007 - A Simulation Framework for Cluster-based Web services [Articolo su rivista]
Poleggi, M. E.; Casalicchio, E.; Lancellotti, Riccardo
abstract

We propose a simulation framework, namely CWebSim, specifically designed for the performance evaluation and capacity planning of cluster-based Web services. A broad variety of Web cluster configurations can be simulated through CWebSim. Its modularity permits the definition of different mechanisms, algorithms, network topologies and hardware resources. Also, two workload input alternatives are possible: a trace-driven mode and a distribution-driven mode that encompasses the most recent results on Web workload characterization. We present two case studies to show how CWebSim can be used to test cache cooperation protocols and Web switch dispatching algorithms.

2007 - A distributed infrastructure supporting personalized services for the Mobile Web [Relazione in Atti di Convegno]
Canali, Claudia; Colajanni, Michele; Lancellotti, Riccardo; Yu, P. S.
abstract

Personalized services are a key feature for the success of the next generation Web that is accessed by heterogeneous and mobile client devices. The need to provide high performance and to preserve user data privacy opens a novel dimension in the design of infrastructures and request dispatching algorithms to support personalized services for the Mobile Web. Performance issues are typically addressed by distributed architectures consisting of multiple nodes. Personalized services that are often based on sensitive user information may introduce constraints on the service location when the nodes of the distributed architecture do not provide the same level of security. In this paper, we propose an infrastructure and related dispatching algorithms that aim to combine performance and privacy requirements. The proposed scheme may efficiently support personalized services for the Mobile Web especially if compared with existing solutions that separately address performance and privacy issues. Our proposal guarantees that up to the 97% of the requests accessing sensitive user information are assigned to the most secure nodes with limited penalty consequences on the response time.

2007 - Impact of request dispatching granularity in geographically distributed Web systems [Relazione in Atti di Convegno]
Andreolini, Mauro; Canali, Claudia; Lancellotti, Riccardo
abstract

The advent of the mobileWeb and the increasing demand for personalized contents arise the need for computationally expensive services, such as dynamic generation and on-the-fly adaptation of contents. Providing these services exacerbates the performance issues that have to be addressed by the underlying Web architecture. When performance issues are addressed through geographically distributed Web systems with a large number of nodes located on the network edge, the dispatching mechanism that distributes requests among the system nodes becomes a critical element. In this paper, we investigate how the granularity of request dispatching may affect the performance of a distributed Web system for personalized contents. Through a real prototype, we compare dispatching mechanisms operating at various levels of granularity for different workload and network scenarios. We demonstrate that the choice of the best granularity for request dispatching strongly depends on the characteristics of the workload in terms of heterogeneity and computational requirements. A coarsegrain dispatching is preferable only when the requests have similar computational requirements. In all other instances of skewed workloads, that we can consider more realistic, a fine-grain dispatching augments the control on the node load and allows the system to achieve better performance.

2006 - A distributed architecture to support infomobility services [Relazione in Atti di Convegno]
Canali, C.; Lancellotti, R.
abstract

The growing popularity of mobile and location aware devices allows the deployment of infomobility systems that provide access to information and services for the support of user mobility. Current systems for infomobility services assume that most information is already available on the mobile device and the device connectivity is used for receiving critical messages from a central server. However, we argue that the next generation of infomobility services will be characterized by collaboration and interaction among the users, provided through real-time bidirectional communication between the client devices and the infomobility system.In this paper we propose an innovative architecture to support next generation infomobility services providing interaction and collaboration among the mobile users that can travel by several different transportation means, ranging from cars to trains to foot. We discuss the design issues of the architecture, with particular emphasis on scalability, availability and user data privacy, which are critical in a collaborative infomobility scenario. Copyright 2006 ACM.

2006 - A distributed architecture to support infomoblity services [Relazione in Atti di Convegno]
Canali, Claudia; Lancellotti, Riccardo
abstract

The growing popularity of mobile and location aware devices allows the deployment of infomobility systems that provide access to information and services for the support of user mobility. Current systems for infomobility services assume that most information is already available on the mobile device and the device connectivity is used for receiving critical messages from a central server. However, we argue that the next generation of infomobility services will be characterized by collaboration and interaction among the users, provided through real-time bidirectional communication between the client devices and the infomobility system.In this paper we propose an innovative architecture to support next generation infomobility services providing interaction and collaboration among themobile users that can travel by several different transportation means, ranging from cars to trains to foot. We discuss the design issues of the architecture, with particular emphasis on scalability, availability and user data privacy, which are critical in a collaborative infomobility scenario.

2006 - Content Adaptation Architectures Based on Squid Proxy Server [Articolo su rivista]
Canali, Claudia; Cardellini, V.; Lancellotti, Riccardo
abstract

The overwhelming popularity of Internet and the technology advancements have determined the diffusion of many different Web-enabled devices. In such an heterogeneous client environment, efficient content adaptation and delivery services are becoming a major requirement for the new Internet service infrastructure. In this paper we describe intermediary-based architectures that provide adaptation and delivery of Web content to different user terminals. We present the design of a Squid-based prototype that carries out the adaptation of Web images and combines such a functionality with the caching of multiple versions of the same resource. We also investigate how to provide some form of cooperation among the nodes of the intermediary infrastructure, with the goal to evaluate to what extent the cooperation in discovering, adapting, and delivering Web resources can improve the user-perceived performance.

2006 - Distributed architectures for high performance and privacy-aware content generation and delivery [Relazione in Atti di Convegno]
Canali, Claudia; Colajanni, Michele; Lancellotti, Riccardo
abstract

The increasing heterogeneity of mobile client devices used to access the Web requires run-time adaptations of the Web contents. A significant trend in these content adaptation services is the growing amount of personalization required by users. Personalized services are and will be a key feature for the success of the next generation Web, but they open two critical issues: performance and profile management. Issues related to the performance of adaptation services are typically addressed by highly distributed architectures with a large number of nodes located closer to user. On the other hand, the management of user profile must take into account the nature of these data that may contain sensitive information, such as geographic position, navigation history and personal preferences that should be kept private.In this paper, we propose a distributed architecture for the ubiquitous Web access that provides high performance, while addressing the privacy issues related to the management of sensitive user information. The proposed distributed-core architecture splits the adaptation services over multiple nodes distributed over a two-level topology, thus exploiting parallel adaptations to improve the user perceived performance.

2006 - Distribution of adaptation services for Ubiquitous Web access driven by user profile [Relazione in Atti di Convegno]
Canali, Claudia; Colajanni, Michele; Lancellotti, Riccardo
abstract

The popularity of ubiquitous Web access requires runtime adaptations of the Web contents. A significant trend in these content adaptation services is the growing amount of personalization required by users. Personalized services are and will be a key feature for the success of the ubiquitous Web, but they open two critical issues: performance and profile management. Issues related to the performance of adaptation services are typically addressed by highly distributed architectures with a large number of nodes located closer to user. On the other hand, the management of user profile must take into account the nature of these data that may contain sensitive information, such as geographic position, navigation history and personal preferences that should be kept private.In this paper, we investigate the impact that a correct profile management has on distributed infrastructures that provide content adaptation services for ubiquitous Web access. In particular, we propose and compare two scalable solutions of adaptation services deployed on the nodes of a two-level topology. We study, through real prototypes, the performance and the constraints that characterize the proposed architectures.

2006 - Scalable architectures and services for ubiquitous Web access [Relazione in Atti di Convegno]
Colajanni, Michele; Lancellotti, Riccardo; Yu, P. S.
abstract

N/A

2006 - Web System Reliability and Performance [Capitolo/Saggio]
Andreolini, Mauro; Colajanni, Michele; Lancellotti, Riccardo
abstract

Abstract. Modern Web sites provide multiple services that are deployed through complex technologies. The importance and the economic impact of consumeroriented Web sites introduce significant requirements in terms of performance and reliability. This chapter presents some methods for the design of novel Web sites and for the improvement of existing systems that must satisfy some performance requirements even in the case of unpredictable load variations. The chapter is concluded with a case study that describes the application of the proposed methods to a typical consumeroriented Web site.

2005 - A Two-level Distributed Architecture for Web Content Adaptation and Delivery [Relazione in Atti di Convegno]
Canali, Claudia; Cardellini, V; Colajanni, Michele; Lancellotti, Riccardo; Yu, P. S.
abstract

The complexity of services provided through the Web iscontinuously increasing as well as the variety of new devicesthat are gaining access to the Internet. Tailoring Weband multimedia resources to meet the user and client requirementsopens two main novel issues in the research areaof content delivery. The working set tends to increase substantiallybecause multiple versions may be generated fromthe same original resource. Moreover, the content adaptationoperations may be computationally expensive. In thispaper, we consider third-party infrastructures composed bya geographically distributed system of intermediary and cooperativenodes that provide fast content adaptation anddelivery of Web resources. We propose a novel distributedarchitecture of intermediary nodes which are organized intwo levels. The front-end nodes in the first tier are thin edgeservers that locate the resources and forward the client requeststo the nodes in the second tier. These interior nodesare fat servers that run the most expensive functions such ascontent adaptation, resource caching and fetching. Throughreal prototypes we compare the performance of the proposedtwo-level architecture to that of alternative one-levelinfrastructures where all nodes are fat peers providing theentire set of functions.

2005 - Architectures for scalable and flexible Web personalization services [Relazione in Atti di Convegno]
Canali, Claudia; Casolari, Sara; Lancellotti, Riccardo
abstract

The complexity of services provided through theWeb is con-tinuously increasing and issues introduced by both heteroge-neous client devices and Web content personalization are be-coming a major challenge for the Web. Tailoring Web andmultimedia resources tomeet the user and client requirementsopens twomain novel issues in the research area of content de-livery. The content adaptation operations may be computa-tionally expensive, requiring high efficiency and scalability intheWeb architectures.Moreover, personalization services in-troduce security and consistency issues for user profile infor-mation management. In this paper, we propose a novel dis-tributed architecture, with four variants, for the efficient de-livery of personalized service where the nodes are organizedin two levels.We discuss how the architectural choices are af-fected by security and consistency constraints as well as by theaccess to privileged information of the content provider.More-over we discuss performance trade-offs of the various choices.

2005 - Distributed Systems to Support Efficient Adaptation for Ubiquitous Web [Relazione in Atti di Convegno]
Canali, Claudia; Casolari, Sara; Lancellotti, Riccardo
abstract

The ubiquitous Web will require many adaptation and personalization serviceswhich will be consumed by an impressive amount of dierent devices and classes ofusers. These novel advanced services will stress the content provider platforms in anunprecedented way with respect to the content delivery seen in the last decade. Mostservices such as multimedia content manipulation (images, audio and video clips) arecomputationally expensive and no single server will be able to provide all of them,hence scalable distributed architectures will be the common basis for the delivery platform.Moreover these platforms would even address novel content management issuesthat are related to the replication and to the consistency and privacy requirements ofuser/client information. In this paper we propose two scalable distributed architecturesthat are based on a two-level topology. We investigate the pros and cons of sucharchitectures from both a security, consistency and performance points of view.

2005 - Distributed architectures for Web content adaptation and delivery [Capitolo/Saggio]
Colajanni, Michele; Lancellotti, Riccardo; Yu, P. S.
abstract

N/A

2005 - Hybrid cooperative schemes for scalable and stable performance of Web content delivery [Articolo su rivista]
Lancellotti, Riccardo; Mazzoni, Francesca; Colajanni, Michele
abstract

Systems consisting of multiple edge servers are a popular solution to deal with performance and network resource utilization problems related to the growth of the Web. After a first period of prevalent enthusiasm towards cooperating edge servers, the research community is exploring in a more systematic way the real benefits and limitations of cooperative caching. Hierarchical cooperation has clearly shown its limits. We show that the pure protocols (e.g., directory-based, query-based) applied to a flat cooperation topology do not scale as well. For increasing numbers of cooperating edge servers, the amount of exchanged data necessary for cooperation augments exponentially, or the cache hit rates fall down, or both events occur. We propose and evaluate two hybrid cooperation schemes for document discovery and delivery. They are based on a semi-flat architecture that organizes the edge servers in groups and combines directory-based and query-based cooperation protocols. A large set of experimental results confirms that the combination of directory-based and query-based schemes increases the scalability of flat architectures based on pure protocols, guarantees more stable performance and tends to reduce pathologically long response times.

2005 - Impact of memory technology trends on performance of Web systems [Relazione in Atti di Convegno]
Andreolini, Mauro; Colajanni, Michele; Lancellotti, Riccardo
abstract

The hardware technology continues to improve at a considerable rate. Besides the Moore law increments of the CPU speed, it should be considered that the capacity of the main memory in the last years is increasing at an even more impressive rate. One of the consequences of a continuous increment of memory resource is that we can design and implement memory-embedded Web sites, where both the static resources and the database information is kept in main memory. In this paper, we evaluate the impact of memory trends on the performance of e-commerce sites that continue to be an important reference for Internet-based services in terms of complexity of the hardware/software technology and in terms of performance, availability and scalability requirements. However, most results are valid even for other Web-based services. We demonstrate through experiments on a real system how the system bottlenecks change depending on the amount of memory that is (or will be) available for the Web site data. This analysis allows us to anticipate the interventions on the hardware/software components that could improve the capacity of present and future Web systems for content generation and delivery.

2005 - Impact of technology trends on performance of Web-based services [Relazione in Atti di Convegno]
Andreolini, Mauro; Colajanni, Michele; Lancellotti, Riccardo
abstract

2005 - Performance comparison of distributed architectures for content adaptation and delivery of Web resources [Relazione in Atti di Convegno]
Canali, Claudia; Cardellini, V; Colajanni, Michele; Lancellotti, Riccardo
abstract

The increasing popularity of heterogeneous Webenableddevices and wired/wireless connections motivatesthe diffusion of content adaptation services that enrich thetraditional Web. Different solutions have been proposedfor the deployment of efcient adaptation and delivery services:in this paper we focus on intermediate infrastructuresthat consist of multiple server nodes. We investigatewhen it is really convenient to place this distributed infrastructurecloser to the clients or to the origin servers,and which is the real gain that can be get by node cooperation.We evaluate the system performance through threeprototypes that are placed in a WAN-emulated environmentand are subject to two types of workload.

2005 - Web system reliability and performance: design and testing methodologies [Capitolo/Saggio]
Andreolini, Mauro; Colajanni, Michele; Lancellotti, Riccardo
abstract

Modern Web sites provide multiple services that are deployed through complex technologies. The importance and the economic impact of consumeroriented Web sites introduce significant requirements in terms of performance and reliability. This chapter presents some methods for the design of novel Web sites and for the improvement of existing systems that must satisfy some performance requirements even in the case of unpredictable load variations. The chapter is concluded with a case study that describes the application of the proposed methods to a typical consumeroriented Web site.

2004 - Analysis of peer-to-peer systems: workload characterization and effects on traffic cacheability [Relazione in Atti di Convegno]
Andreolini, Mauro; Lancellotti, Riccardo; Yu, P. S.
abstract

Peer-to-peer file sharing networks have emerged as a new popular application in the Internet scenario. In this paper, we provide an analytical model of the resources size and of the contents shared at a given node. We also study the composition of the content workload hosted in the Gnutella network over time. Finally, we investigate the negative impact of oversimplified hypotheses (e.g., the use of filenames as resource identifiers) on the potentially achievable hit rate of a file sharing cache. The message coming out of our findings is clear: file sharing traffic can be reduced by using a cache to minimize download time and network usage. The design and tuning of the cache server should take into account the presence of different resources sharing the same name and should consider push-based downloads. Failing to do so can result in reduced effectiveness of the caching mechanism.

2004 - Evaluating User-perceived Benefits of Content Distribution Networks [Relazione in Atti di Convegno]
Canali, Claudia; Cardellini, V.; Colajanni, Michele; Lancellotti, Riccardo
abstract

Content Distribution Networks (CDNs) are a class of successful content delivery architectures used by the most popular Web sites to enhance their performance. The basic idea is to address Internet bottleneck issues by replicating and caching the content of the customer Web sites and to serve it from the edge of the network. In this paper we evaluate to what extent the use of a CDN can improve the user-perceived response time. We consider a large set of scenarios with different network conditions and client connections, that have not been examined in previous studies. We found that CDNs can offer significative performance gain in normal network conditions, but the advantage of using CDNs can be reduced by heavy network traffic. Moreover, if CDN usage is not carefully designed, the achieved speedup can be suboptimal.

2004 - Fine grain performance evaluation of e-commerce sites [Articolo su rivista]
Andreolini, Mauro; Colajanni, Michele; Lancellotti, Riccardo; Mazzoni, F.
abstract

E-commerce sites are still a reference for the Web technology in terms of complexity and performance requirements, including availability and scalability. In this paper we show that a coarse grain analysis, that is used in most performance studies, may lead to incomplete or false deductions about the behavior of the hardware and software components supporting e-commerce sites. Through a fine grain performance evaluation of a medium size e-commerce site, we find some interesting results that demonstrate the importance of an analysis approach that is carried out at the software function level with the combination of distribution oriented metrics instead of average values.

2004 - Open issues in self-inspection and self-decision mechanisms for supporting complex and heterogeneous information systems [Relazione in Atti di Convegno]
Colajanni, Michele; Andreolini, Mauro; Lancellotti, Riccardo
abstract

Self-* properties seem an inevitable mean to manage the increasing complexity of networked information systems. The implementation of these properties imply sophisticated software and decision supports. Most research results have focused on the former aspects with many proposals of passing from traditional to reflective middleware. In this paper we focus instead on the supports to the run-time decisions that any self-* software should take, independently of the underlying software used to achieve some self-properties. We evidence the problems of self-inspection and self-decision models and mechanisms that have to operate in real-time and in extremely heterogeneous environments. Without an adequate solution to these inspection and decision problems, self-* systems have no chance of real applicability to complex and heterogeneous information systems.

2004 - Peer-to-Peer workload characterization: techniques and open issues [Relazione in Atti di Convegno]
Andreolini, Mauro; Colajanni, Michele; Lancellotti, Riccardo
abstract

The popularity of peer-to-peer file sharing networks has attracted multiple interests even in the research community. In this paper, we focus on workload characterization of file-sharing systems that should be at the basis of performance evaluation and investigations for possible improvements.The contribution of this paper is twofold: first, we provide a classification of related studies on file-sharing workload by distinguishing the main considered information and the mechanisms and tools that have been used for data collection.We also point out open issues in file-sharing workload characterization and suggest novel approaches to workload studies.

2004 - System architectures for Web content adaptation services [Articolo su rivista]
Colajanni, Michele; Lancellotti, Riccardo
abstract

N/A

2003 - A Distributed Cooperation Schemes for Document Lookup in Geographically Dispersed Cache Servers [Relazione in Atti di Convegno]
Lancellotti, Riccardo; Colajanni, Michele; Ciciani, Bruno
abstract

Architectures consisting of multiple cache servers are a popular solution to deal with performance and network resource utilization issues related to the growth of the Web request. Cache cooperation is often carried out through purely hierarchical and flat schemes that suffer from scalability problems when the number of servers increases.We propose, implement and compare the performance of three novel distributed cooperation models based on a two-tier organization of the cache servers. The experimental results show that the proposed architectures are effective in supporting cooperative document lookup and download. They guarantee cache hit rates comparable to those of the most performing protocols with a significant reduction of the cooperation overhead. Moreover, in case of congested network, they reduce the 90-percentile of the system response time up to nearly 30% with respect to the best pure cooperation mechanisms.

2003 - A distributed architecture of edge proxy servers for cooperative transcoding [Relazione in Atti di Convegno]
V., Cardellini; Colajanni, Michele; Lancellotti, Riccardo; P., Yu
abstract

The large variety of devices that are gaining access tothe Internet requires novel server functionalities to tailorWeb content at run-time, namely transcoding. Traditionalschemes assign transcoding operations to the Web server orsingle edge proxies. We propose an alternative architectureconsisting of cooperative proxy servers which collaboratein discovering and transcoding multiple versions of Web objects.The transcoding functionality opens an entirely newspace of investigation in the research area of cache cooperation,because it transforms the proxy servers from contentrepositories into pro-active network elements providingcomputation and adaptive delivery. We investigate andevaluate experimentally different schemes for cooperativediscovery of multi-version content and transcoding in thecontext of a flat topology of edge servers.

2003 - Cooperative Architectures and Algorithms for Discovery and Transcoding of Multi-version Contents [Relazione in Atti di Convegno]
Canali, Claudia; Cardellini, V; Colajanni, Michele; Lancellotti, Riccardo; Yu, P. S.
abstract

A clear trend of the Web is that a variety of new consumer deviceswith diverse computing powers, display capabilities, andwired/wireless network connections is gaining access to the Internet.Tailoring Web content to match the device characteristicsrequires functionalities for content transformation, namelytranscoding, that are typically carried out by the content Webserver or by an edge proxy server. In this paper, we explorehow to improve the user response time by considering systemsof cooperative edge servers which collaborate in discovering,transcoding, and delivering multiple versions of Web objects.The transcoding functionality opens an entirely new space ofinvestigation in the research area of distributed cache cooperation,because it transforms the proxy servers from contentrepositories along the client-server path into pro-active networkelements providing computation and adaptive delivery.We propose and investigate different algorithms for cooperativediscovery, delivery, and transcoding in the context of edgeservers organized in hierarchical and flat peer-to-peer topologies.We compare the performance of the proposed schemesthrough ColTrES (Collaborative Transcoder Edge Services),a flexible prototype testbed that implements all consideredmechanisms.

2003 - Cooperative TransCaching: A System of Distributed Proxy Servers for Web Content Adaptation [Relazione in Atti di Convegno]
Canali, Claudia; Cardellini, V; Colajanni, Michele; Lancellotti, Riccardo; Yu, P. S.
abstract

The Web is rapidly evolving towards a highly heterogeneous accessed environment, due to the variety of new devices with diverse capabilities and network interfaces. Hence, there is an increasing demand for solutions that enable the transformation of Web content for adapting and delivering it to diverse destination devices.We investigate different schemes for cooperative proxy caching and transcoding that can be implemented in the existing Web infrastructure and compare their performance through prototypes that extend Squid operations to an heterogeneous client environment.

2003 - Distributed Architectures of Active Proxy Servers for Cooperative Transcoding [Relazione in Atti di Convegno]
Cardellini, V.; Colajanni, Michele; Lancellotti, Riccardo; Yu, P. S.
abstract

The large variety of devices that are gaining access to the Internet requires novel server functionalities to tailor Web content at run-time, namely transcoding. Traditional schemes assign transcoding operations to theWeb server or single edge proxies. We propose an alternative architecture consisting of cooperative proxy servers which collaborate in discovering and transcoding multiple versions ofWeb objects. The transcoding functionality opens an entirely new space of investigation in the research area of cache cooperation, because it transforms the proxy servers from content repositories into pro-active network elements providing computation and adaptive delivery. We investigate and evaluate experimentally different schemes for cooperative discovery of multi-version content and transcoding in the context of a flat topology of edge servers.

2003 - Distributed cooperation schemes for document lookup in multiple cache servers [Relazione in Atti di Convegno]
Lancellotti, Riccardo; B., Ciciani; Colajanni, Michele
abstract

Architectures consisting of multiple cache servers are a popular solution to deal with performance and network resource utilization issues related to the growth of the Web request. Cache cooperation is often carried out through purely hierarchical and flat schemes that suffer from scalability problems when the number of servers increases. We propose, implement and compare the performance of three novel distributed cooperation models based on a two-tier organization of the cache servers. The experimental results show that the proposed architectures are effective in supporting cooperative document lookup and download They guarantee cache hit rates comparable to those of the most performing protocols with a significant reduction of the cooperation overhead Moreover in case of congested network, they reduce the 90-percentile of the system response time up to nearly 30% with respect to the best pure cooperation mechanisms.

2003 - Scalability of Cooperative algorithms for distributed architectures of proxy servers [Relazione in Atti di Convegno]
Lancellotti, Riccardo; Mazzoni, Francesca; Colajanni, Michele
abstract

Systems consisting of multiple proxy servers are a popular solution to deal with performance and network resource utilization problems related to the growth of the Web numbers. After a first period of prevalent enthusiasm towards cooperating proxy servers, the research community is exploring in a more systematic way the real benefits and limitations of cooperative caching. Hierarchical cooperation has clearly shown its limits. We study the scalability of traditional protocols (e.g., directory-based, query-based) in flat architectures through different performance metrics and experiments using both synthetic workloads and traces. The synthetic workload is also used for sensitivity analysis with respect to various parameters while traces are used for validating our observations in a more realistic scenario.We show that ICP has a better hit rate than that of Cache Digests, but the latter has a much smaller overhead, thus making the choice between the two protocols a challenge depending on the providers’ interest: if the hit rate is the most important parameter, you should certainly choose ICP, while if you are mostly concerned with keeping the overhead low, then your choice should go to Cache Digests. In any case, both protocols show scalability problems when applied to a large number of cooperating cache servers.

2002 - A scalable architecture for cooperative Web caching [Relazione in Atti di Convegno]
Lancellotti, Riccardo; Ciciani, Bruno; Colajanni, Michele
abstract

Cooperative Web caching is the most common solution for augmenting the low cache hit rates due to single proxies. However, both purely hierarchical and flat architectures suffer from scalability problems due to cooperation protocol overheads. We present a new cooperative architecture that organizes cache servers in well connected clusters and implements a novel cooperation model based on a two-tier lookup process. The experimental results carried out on a working prototype show that the proposed architecture is really effective in supporting cooperative Web caching because it guarantees cache hit rates comparable to those of the most performing architectures and it reduces cooperation overhead at a small fraction of that of other protocols.

Università degli studi di Modena e Reggio Emilia

Pubblicazioni