Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Přemysl Čech is active.

Publication


Featured researches published by Přemysl Čech.


Information Systems | 2014

On indexing metric spaces using cut-regions

Jakub Lokoč; Juraj Moško; Přemysl Čech; Tomáš Skopal

After two decades of research, the techniques for efficient similarity search in metric spaces have combined virtually all the available tricks resulting in many structural index designs. As the representative state-of-the-art metric access methods (also called metric indexes) that vary in the usage of filtering rules and in structural designs, we could mention the M-tree, the M-Index and the List of Clusters, to name a few. In this paper, we present the concept of cut-regions that could heavily improve the performance of metric indexes that were originally designed to employ simple ball-regions. We show that the shape of cut-regions is far more compact than that of ball-regions, yet preserving simple and concise representation. We present three re-designed metric indexes originating from the above-mentioned ones but utilizing cut-regions instead of ball-regions. We show that cut-regions can be fully utilized in the index structure, positively affecting not only query processing but also the index construction. In the experiments we show that the re-designed metric indexes significantly outperform their original versions. HighlightsThe new cut-region formalism that is suitable for simplified description of compact metric regions.New cheap dynamic construction techniques for the PM-tree that can compete with expensive strategies of the original PM-tree (e.g., multi-way leaf selection).Adaptation of M-Index and List of Clusters to operate with cut-regions.Thorough experimental evaluation also including comparison with the state-of-the-art MAMs.


pacific asia workshop on intelligence and security informatics | 2016

k-NN Classification of Malware in HTTPS Traffic Using the Metric Space Approach

Jakub Lokoăź; Jan Kohout; Přemysl Čech; Tomáš Skopal; Tomáš Pevný

In this paper, we present detection of malware in HTTPS traffic using k-NN classification. We focus on the metric space approach for approximate k-NN searches over dataset of sparse high-dimensional descriptors of network traffic. We show the classification based on approximate k-NN search using metric index exhibits false positive rate reduced by an order of magnitude when compared to the state of the art method, while keeping the classification fast enough.


similarity search and applications | 2016

Feature Extraction and Malware Detection on Large HTTPS Data Using MapReduce

Přemysl Čech; Jan Kohout; Jakub Lokoč; Tomáš Komárek; Jakub Maroušek; Tomáš Pevný

Secure HTTP network traffic represents a challenging immense data source for machine learning tasks. The tasks usually try to learn and identify infected network nodes, given only limited traffic features available for secure HTTP data. In this paper, we investigate the performance of grid histograms that can be used to aggregate traffic features of network nodes considering just 5-min batches for snapshots. We compare the representation using linear and k-NN classifiers. We also demonstrate that all presented feature extraction and classification tasks can be implemented in a scalable way using the MapReduce approach.


similarity search and applications | 2012

Cut-Region: a compact building block for hierarchical metric indexing

Jakub Lokoč; Přemysl Čech; Jiří Novák; Tomáš Skopal

With the emerging applications dealing with complex multimedia retrieval, such as the multimedia exploration, appropriate indexing structures need to be designed. A formalism for compact metric region description can significantly simplify the design of algorithms for such indexes, thus more complex and efficient metric indexes can be developed. In this paper, we introduce the cut-regions that are suitable for compact metric region description and we discuss their basic operations. To demonstrate the power of cut-regions, we redefine the PM-Tree using the cut-region formalism and, moreover, we use the formalism to describe our new improvements of the PM-Tree construction techniques. We have experimentally evaluated that the improved construction techniques lead to query performance originally obtained just using expensive construction techniques. Also in comparison with other metric and spatial access methods, the revisited PM-Tree proved its benefits.


advances in databases and information systems | 2015

MLES: Multilayer Exploration Structure for Multimedia Exploration

Juraj Moško; Jakub Lokoč; Tomáš Grošup; Přemysl Čech; Tomáš Skopal; Jan Lánský

The traditional content-based retrieval approaches usually use flat querying, where whole multimedia database is searched for a result of some similarity query with a user specified query object. However, there are retrieval scenarios (e.g., multimedia exploration), where users may not have a clear search intents in their minds, they just want to inspect a content of the multimedia collection. In such scenarios, flat querying is not suitable for the first phases of browsing, because it retrieves the most similar objects and does not consider a view on part of a multimedia space from different perspectives. Therefore, we defined a new Multilayer Exploration Structure (MLES), that enables exploration of a multimedia collection in different levels of details. Using the MLES, we formally defined popular exploration operations (zoom-in/out, pan) to enable horizontal and vertical browsing in explored space and we discussed several problems related to the area of multimedia exploration.


conference on multimedia modeling | 2015

A Web Portal for Effective Multi-model Exploration

Tomáš Grošup; Přemysl Čech; Jakub Lokoč; Tomáš Skopal

During last decades, there have emerged various similarity models suitable for specific similarity search tasks. In this paper, we present a web-based portal that combines two popular similarity models (based on feature signatures and SURF descriptors) in order to improve the recall of multimedia exploration. Comparing to single-model approach, we demonstrate in the game-like fashion that a multi-model approach could provide users with more diverse and still relevant results.


similarity search and applications | 2017

Malware Discovery Using Behaviour-Based Exploration of Network Traffic

Jakub Lokoč; Tomáš Grošup; Přemysl Čech; Tomáš Pevný; Tomáš Skopal

We present a demo of behaviour-based similarity retrieval in network traffic data. The underlying framework is intended to support domain experts searching for network nodes (computers) infected by malicious software, especially in cases when single client-server communication does not have to be sufficient to reliably identify the infection. The focus is on interactive browsing enabling dynamic changes of the retrieval model, which is based on a recently proposed statistical description (fingerprint) of a communication between two network hosts and the bag of features approach. The demo/framework provides unique insight into the data and enables annotation of the data and model modifications during the search for more effective identification of infected hosts.


advanced data mining and applications | 2017

Comparing MapReduce-Based k-NN Similarity Joins on Hadoop for High-Dimensional Data

Přemysl Čech; Jakub Maroušek; Jakub Lokoč; Yasin N. Silva; Jeremy Starks

Similarity joins represent a useful operator for data mining, data analysis and data exploration applications. With the exponential growth of data to be analyzed, distributed approaches like MapReduce are required. So far, the state-of-the-art similarity join approaches based on MapReduce mainly focused on the processing of vector data with less than one hundred dimensions. In this paper, we revisit and investigate the performance of different MapReduce-based approximate k-NN similarity join approaches on Apache Hadoop for large volumes of high-dimensional vector data.


Expert Systems With Applications | 2018

Learning communication patterns for malware discovery in HTTPs data

Jan Kohout; Tomáš Komárek; Přemysl Čech; Jan Bodnár; Jakub Lokoč

Abstract Encrypted communication on the Internet using the HTTPs protocol represents a challenging task for network intrusion detection systems. While it significantly helps to preserve users’ privacy, it also limits a detection system’s ability to understand the traffic and effectively identify malicious activities. In this work, we propose a method for modeling and representation of encrypted communication from logs of web communication. The idea is based on introducing communication snapshots of individual users’ activity that model contextual information of the encrypted requests. This helps to compensate the information hidden by the encryption. We then propose statistical descriptors of the communication snapshots that can be consumed by various machine learning algorithms for either supervised or unsupervised analysis of the data. In the experimental evaluation, we show that the presented approach can be used even on a large corpus of network traffic logs as the process of creation of the descriptors can be effectively implemented on a Hadoop cluster.


similarity search and applications | 2015

Evaluating Multilayer Multimedia Exploration

Juraj Moško; Jakub Lokoăź; Tomáš Grošup; Přemysl Čech; Tomáš Skopal; Jan Lánský

Multimedia exploration is an entertaining approach for multimedia retrieval enabling users to interactively browse and navigate through multimedia collections in a content-based way. The multimedia exploration approach extends the traditional query-by-example retrieval scenario to be a more intuitive approach for obtaining a global overview over an explored collection. However, novel exploration scenarios require many user studies demonstrating their benefits. In this paper, we present results of an extensive user study focusing on the comparison of 3-layer Multilayer Exploration Structure MLES structure with standard flat k-NN browsing. The results of the user study show that principles of the MLES lead to better effectiveness of the exploration process, especially when searching for a first object of the searched concept in an unknown collection.

Collaboration


Dive into the Přemysl Čech's collaboration.

Top Co-Authors

Avatar

Jakub Lokoč

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Tomáš Skopal

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Tomáš Grošup

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Juraj Moško

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Tomáš Pevný

Czech Technical University in Prague

View shared research outputs
Top Co-Authors

Avatar

Jakub Lokoăź

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar

Jakub Maroušek

Charles University in Prague

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Jan Lánský

University of Finance and Administration

View shared research outputs
Top Co-Authors

Avatar

Jan Bodnár

Charles University in Prague

View shared research outputs
Researchain Logo
Decentralizing Knowledge