Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Nikos Zacheilas is active.

Publication


Featured researches published by Nikos Zacheilas.


european conference on machine learning | 2016

Intelligent Urban Data Monitoring for Smart Cities

Nikolaos Panagiotou; Nikolas Zygouras; Ioannis Katakis; Dimitrios Gunopulos; Nikos Zacheilas; Ioannis Boutsis; Vana Kalogeraki; Stephen Lynch; Brendan O’Brien

Urban data management is already an essential element of modern cities. The authorities can build on the variety of automatically generated information and develop intelligent services that improve citizens daily life, save environmental resources or aid in coping with emergencies. From a data mining perspective, urban data introduce a lot of challenges. Data volume, velocity and veracity are some obvious obstacles. However, there are even more issues of equal importance like data quality, resilience, privacy and security. In this paper we describe the development of a set of techniques and frameworks that aim at effective and efficient urban data management in real settings. To do this, we collaborated with the city of Dublin and worked on real problems and data. Our solutions were integrated in a system that was evaluated and is currently utilized by the city.


distributed event-based systems | 2017

Maximizing Determinism in Stream Processing Under Latency Constraints

Nikos Zacheilas; Vana Kalogeraki; Vincenzo Gulisano; Marina Papatriantafilou; Philippas Tsigas

The problem of coping with the demands of determinism and meeting latency constraints is challenging in distributed data stream processing systems that have to process high volume data streams that arrive from different unsynchronized input sources. In order to deterministically process the streaming data, they need mechanisms that synchronize the order in which tuples are processed by the operators. On the other hand, achieving real-time response in such a system requires careful tradeoff between determinism and low latency performance. We build on a recently proposed approach to handle data exchange and synchronization in stream processing, namely ScaleGate, which comes with guarantees for determinism and an efficient lock-free implementation, enabling high scalability. Considering the challenge and trade-offs implied by real-time constraints, we propose a system which comprises (a) a novel data structure called Slack-ScaleGate (SSG), along with its algorithmic implementation; SSG enables us to guarantee the deterministic processing of tuples as long as they are able to meet their latency constraints, and (b) a method to dynamically tune the maximum amount of time that a tuple can wait in the SSG data-structure, relaxing the determinism guarantees when needed, in order to satisfy the latency constraints. Our detailed experimental evaluation using a traffic monitoring application deployed in the city of Dublin, illustrates the working and benefits of our approach.


distributed event-based systems | 2018

Probabilistic Management of Late Arrival of Events

Nicolo Rivetti; Nikos Zacheilas; Avigdor Gal; Vana Kalogeraki

In a networked world, events are transmitted from multiple distributed sources into CEP systems, where events are related to one another along multiple dimensions, e.g., temporal and spatial, to create complex events. The big data era brought with it an increase in the scale and frequency of event reporting. Internet of Things adds another layer of complexity with multiple, continuously changing event sources, not all of which are perfectly reliable, often suffering from late arrivals. In this work we propose a probabilistic model to deal with the problem of reduced reliability of event arrival time. We use statistical theories to fit the distributions of inter-generation at the source and network delays per event type. Equipped with these distributions we propose a predictive method for determining whether an event belonging to a window has yet to arrive. Given some user-defined tolerance levels (on quality and timeliness), we propose an algorithm for dynamically determining the amount of time a complex event time-window should remain open. Using a thorough empirical analysis, we compare the proposed algorithm against state-of-the-art mechanisms for delayed arrival of events and show the superiority of our proposed method.


distributed applications and interoperable systems | 2017

DIsCO: DynamIc Data COmpression in Distributed Stream Processing Systems

Nikos Zacheilas; Vana Kalogeraki

Supporting high throughput in Distributed Stream Processing Systems (DSPSs) has been an important goal in recent years. Current works either focus on automatically increasing the system resources whenever the current setup is inadequate or apply load shedding techniques discarding some of the incoming data. However, both approaches have significant shortcomings as they require on the fly application reconfiguration where the application needs to be stopped and re-uploaded in the cluster with the new configurations, and can lead to significant information loss. One approach that has not yet been considered for improving the throughput of DSPSs is exploiting compression algorithms to minimize the communication overhead between components especially in cases where we have large-sized data like live CCTV camera reports. This work is the first that provides a novel framework, built on top of Apache Storm, which enables dynamic compression of incoming streaming data. Our approach uses a profiling algorithm to automatically determine the compression algorithm that should be applied and supports both lossless and lossy compression techniques. Furthermore, we propose a novel algorithm for determining when profiling should be applied. Finally, our detailed experimental evaluation with commonly used stream processing applications, indicates a clear improvement on the applications’ throughput when our proposed techniques are applied.


Eurasip Journal on Embedded Systems | 2017

A Pareto-based scheduler for exploring cost-performance trade-offs for MapReduce workloads

Nikos Zacheilas; Vana Kalogeraki

In recent years, we are observing an increased demand for processing large amounts of data. The MapReduce programming model has been utilized by major computing companies and has been integrated by novel cyber physical systems (CPS) in order to perform large-scale data processing. However, the problem of efficiently scheduling MapReduce workloads in cluster environments, like Amazon’s EC2, can be challenging due to the observed trade-off between the need for performance and the corresponding monetary cost. The problem is exacerbated by the fact that cloud providers tend to charge users based on their I/O operations, increasing dramatically the spending budget. In this paper, we describe our approach for scheduling MapReduce workloads in cluster environments taking into consideration the performance/budget trade-off. Our approach makes the following contributions: (i) we propose a novel Pareto-based scheduler for identifying near-optimal resource allocations for user workloads with respect to performance and monetary cost, and (ii) we develop an automatic configuration of basic tasks’ parameters that allows us to further minimize the user’s spending budget and the jobs’ execution times. Our detailed experimental evaluation using both real and synthetic datasets illustrate that our approach improves the performance of the workloads as much as 50%, compared to its competitors.


european conference on machine learning | 2016

INSIGHT: Dynamic Traffic Management Using Heterogeneous Urban Data

Nikolaos Panagiotou; Nikolas Zygouras; Ioannis Katakis; Dimitrios Gunopulos; Nikos Zacheilas; Ioannis Boutsis; Vana Kalogeraki; Stephen Lynch; Brendan O’Brien; Dermot Kinane; Jakub Marecek; Jia Yuan Yu; Rudi Verago; Elizabeth M. Daly; Nico Piatkowski; Thomas Liebig; Christian Bockermann; Katharina Morik; François Schnitzler; Matthias Weidlich; Avigdor Gal; Shie Mannor; Hendrik Stange; Werner Halft; Gennady L. Andrienko

In this demo we present INSIGHT, a system that provides traffic event detection in Dublin by exploiting Big Data and Crowdsourcing techniques. Our system is able to process and analyze input from multiple heterogeneous urban data sources.


panhellenic conference on informatics | 2015

Dynamic reduce task adjustment for hadoop workloads

Vaggelis Antypas; Nikos Zacheilas; Vana Kalogeraki

In recent years, we observe an increasing demand for systems that are capable of efficiently managing and processing huge amounts of data. Apaches Hadoop, an open-source implementation of Googles MapReduce programming model, has emerged as one of the most popular systems for Big Data processing and is supported by major companies like Facebook, Yahoo! and Amazon. One of the most challenging aspects of executing a Hadoop job, is to configure appropriately the number of reduce tasks. The problem is exacerbated when multiple jobs are executing concurrently competing for the available system resources. Our approach consists of the following components: (i) an algorithm for computing the appropriate number of reduce tasks per job, (ii) the usage of profiler-jobs for gathering information necessary for the reduce task computation and (iii) two different policies for fragmenting the reduce tasks to the available system resources when multiple jobs execute concurrently in the cluster. Our detailed experimental evaluation using traffic monitoring Hadoop jobs on our local cluster, illustrates that our approach is practical and exhibits solid performance.


international conference on autonomic computing | 2014

Real-Time Scheduling of Skewed MapReduce Jobs in Heterogeneous Environments

Nikos Zacheilas; Vana Kalogeraki


extending database technology | 2015

Insights on a Scalable and Dynamic Traffic Management System

Nikolaos Zygouras; Nikos Zacheilas; Vana Kalogeraki; Dermot Kinane; Dimitrios Gunopulos


international conference on big data | 2015

Elastic complex event processing exploiting prediction

Nikos Zacheilas; Vana Kalogeraki; Nikolaos Zygouras; Nikolaos Panagiotou; Dimitrios Gunopulos

Collaboration


Dive into the Nikos Zacheilas's collaboration.

Top Co-Authors

Avatar

Vana Kalogeraki

Athens University of Economics and Business

View shared research outputs
Top Co-Authors

Avatar

Dimitrios Gunopulos

National and Kapodistrian University of Athens

View shared research outputs
Top Co-Authors

Avatar

Nikolaos Panagiotou

National and Kapodistrian University of Athens

View shared research outputs
Top Co-Authors

Avatar

Nikolas Zygouras

National and Kapodistrian University of Athens

View shared research outputs
Top Co-Authors

Avatar

Stathis Maroulis

Athens University of Economics and Business

View shared research outputs
Top Co-Authors

Avatar

Ioannis Boutsis

Athens University of Economics and Business

View shared research outputs
Top Co-Authors

Avatar

Ioannis Katakis

National and Kapodistrian University of Athens

View shared research outputs
Top Co-Authors

Avatar

Nikolaos Zygouras

National and Kapodistrian University of Athens

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Researchain Logo
Decentralizing Knowledge