Antonio Espinosa
Autonomous University of Barcelona
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Antonio Espinosa.
international conference on conceptual structures | 2016
D. Hernandez-Juarez; A. Chacn; Antonio Espinosa; D. Vzquez; Juan C. Moure; A.M. Lpez
Dense, robust and real-time computation of depth information from stereo-camera systems is a computationally demanding requirement for robotics, advanced driver assistance systems (ADAS) and autonomous vehicles. Semi-Global Matching (SGM) is a widely used algorithm that propagates consistency constraints along several paths across the image. This work presents a real-time system producing reliable disparity estimation results on the new embedded energy-efficient GPU devices. Our design runs on a Tegra X1 at 42 frames per second (fps) for an image size of 640480, 128 disparity levels, and using 4 path directions for the SGM method.
european pvm mpi users group meeting on recent advances in parallel virtual machine and message passing interface | 2000
Antonio Espinosa; Tomàs Margalef; Emilio Luque
PVM parallel programming model provides a convenient methodology of creating dynamic master/worker applications. In this paper, we introduce the benefits from the use of KappaPi tool for automatic analysis of master/worker applications. First, by the automatic detection of the master/worker paradigm in the application. And second, by the performance analysis of the application focusing on the performance bottlenecks and the limitations of this master/worker collaboration.
international conference on conceptual structures | 2013
Alejandro Chacón; Juan C. Moure; Antonio Espinosa; Porfidio Hernández
Fast pattern matching is a requirement for many problems, specially for bioinformatics sequence analysis like short read mapping applications. This work presents a variation of the FM-index method, denoted n-step FM-index, that is applied in exact match genome search. We propose an alternative two-dimensional FM-index structure that allows backward-search navigation giving steps of n symbols at a time. The main advantages of this arrangement are the reduction of the computational work, but most importantly, the reduction by n of the chain of dependent data accesses, and the increase in the temporal locality of the data access pattern. This benefit comes at the expense of increasing the total amount of data required for the index. We present an in-depth performance analysis of a multi-core implementation of the algorithm using large references (up to 1.5G). We identify memory latency as the major performance limiter for single-thread execution and memory bandwidth for multi-thread execution. Our proposal provides speedups ranging from 1.4× to 2.4×, when there is no limitation on DRAM capacity. We also analyse the trade-off of compacting the proposed data structure in order to reduce memory capacity requirements, now at the expense of increasing execution time. An extra 33% of DRAM space allows our proposal to improve performance by 1.2×, while doubling DRAM size enables an additional 1.5×. Our proposal of n-step algorithm provides an alternative for pseudo-random memory access algorithms to be redesigned to scale in current and future computer systems.
Scientific Programming | 2002
Eduardo César; Anna Morajko; Tomàs Margalef; Joan Sorribes; Antonio Espinosa; Emilio Luque
Performance analysis and tuning of parallel/distributed applications are very difficult tasks for non-expert programmers. It is necessary to provide tools that automatically carry out these tasks. These can be static tools that carry out the analysis on a post-mortem phase or can tune the application on the fly. Both kind of tools have their target applications. Static automatic analysis tools are suitable for stable application while dynamic tuning tools are more appropriate to applications with dynamic behaviour. In this paper, we describe KappaPi as an example of a static automatic performance analysis tool, and also a general environment based on parallel patterns for developing and dynamically tuning parallel/distributed applications.
Proceedings of the 20th European MPI Users' Group Meeting on | 2013
Aprigio Bezerra; Porfidio Hernández; Antonio Espinosa; Juan C. Moure
We describe the use of non-dedicated clusters by a known group of local applications sharing the computational resources with additional bioinformatics MapReduce applications. We have studied how to effectively use the resources shared by both application types during their execution. In order to keep local application execution times unaffected we consider the configuration of a group of parameters of the Hadoop platform. One of the most relevant aspects to consider is the job scheduling policy. Our aim is to allow that tasks from different jobs that handle the same data blocks are grouped to be run on the same node where the blocks are allocated. Experimental results show that our approach outperforms traditional policies.
european conference on parallel processing | 2000
Antonio Espinosa; Tomàs Margalef; Emilio Luque
In this paper, we describe the use of an automatic performance analysis tool for describing the behaviour of a parallel application. KappaPi tool includes a list of techniques that may help the non-expert users in finding the most important performance problems of their applications. As an example, the tool is used to tune the performance of a parallel simulation of a forest fire propagation model.
Human Mutation | 2016
Steve Laurie; Marcos Fernandez-Callejo; Santiago Marco-Sola; Jean-Rémi Trotta; Jordi Camps; Alejandro Chacón; Antonio Espinosa; Marta Gut; Ivo Gut; Simon Heath; Sergi Beltran
As whole genome sequencing becomes cheaper and faster, it will progressively substitute targeted next‐generation sequencing as standard practice in research and diagnostics. However, computing cost–performance ratio is not advancing at an equivalent rate. Therefore, it is essential to evaluate the robustness of the variant detection process taking into account the computing resources required. We have benchmarked six combinations of state‐of‐the‐art read aligners (BWA‐MEM and GEM3) and variant callers (FreeBayes, GATK HaplotypeCaller, SAMtools) on whole genome and whole exome sequencing data from the NA12878 human sample. Results have been compared between them and against the NIST Genome in a Bottle (GIAB) variants reference dataset. We report differences in speed of up to 20 times in some steps of the process and have observed that SNV, and to a lesser extent InDel, detection is highly consistent in 70% of the genome. SNV, and especially InDel, detection is less reliable in 20% of the genome, and almost unfeasible in the remaining 10%. These findings will aid in choosing the appropriate tools bearing in mind objectives, workload, and computing infrastructure available.
european pvm mpi users group meeting on recent advances in parallel virtual machine and message passing interface | 1999
Antonio Espinosa; F. Parcerisa; Tomàs Margalef; Emilio Luque
Traditional parallel programming forces the programmer to understand the enormous amount of performance information obtained from the execution of a program. In this paper, we show how the use of KappaPi automatic analysis tool helps the programmers of applications to avoid this difficult task. In the last stage of the analysis we discuss the possibilities of establishing relationships between the performance information found and the programming structure of the application.
The Journal of Supercomputing | 2012
Antonio Espinosa; Porfidio Hernández; Juan C. Moure; J. Protasio; Ana Ripoll
The map-reduce paradigm has shown to be a simple and feasible way of filtering and analyzing large data sets in cloud and cluster systems. Algorithms designed for the paradigm must implement regular data distribution patterns so that appropriate use of resources is ensured. Good scalability and performance on Map-Reduce applications greatly depend on the design of regular intermediate data generation-consumption patterns at the map and reduce phases. We describe the data distribution patterns found in current Map-Reduce read mapping bioinformatics applications and show some data decomposition principles to greatly improve their scalability and performance
international conference on computational science | 2016
V. Campmany; S. Silva; Antonio Espinosa; Juan C. Moure; D. Vzquez; A.M. Lpez
We propose a real-time pedestrian detection system for the embedded Nvidia Tegra X1 GPU-CPU hybrid platform. The detection pipeline is composed by the following state-of-the-art algorithms: features extracted from the input image are Histograms of Local Binary Patterns (LBP) and Histograms of Oriented Gradients (HOG); candidate generation using Pyramidal Sliding Window technique; and classification with Support Vector Machine (SVM). Experimental results show that the Tegra ARM platform is two times more energy efficient than a desktop GPU and at least 8 times faster than a desktop multicore CPU.