Is this you? Create Your Porfile

Antonio Espinosa

Autonomous University of Barcelona

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Antonio Espinosa is active.

Explore More

Publication

Featured researches published by Antonio Espinosa.

international conference on conceptual structures | 2016

Embedded Real-time Stereo Estimation via Semi-global Matching on the GPU

D. Hernandez-Juarez; A. Chacn; Antonio Espinosa; D. Vzquez; Juan C. Moure; A.M. Lpez

Dense, robust and real-time computation of depth information from stereo-camera systems is a computationally demanding requirement for robotics, advanced driver assistance systems (ADAS) and autonomous vehicles. Semi-Global Matching (SGM) is a widely used algorithm that propagates consistency constraints along several paths across the image. This work presents a real-time system producing reliable disparity estimation results on the new embedded energy-efficient GPU devices. Our design runs on a Tegra X1 at 42 frames per second (fps) for an image size of 640480, 128 disparity levels, and using 4 path directions for the SGM method.

european pvm mpi users group meeting on recent advances in parallel virtual machine and message passing interface | 2000

Automatic Performance Analysis of Master/Worker PVM Applications with Kpi

Antonio Espinosa; Tomàs Margalef; Emilio Luque

PVM parallel programming model provides a convenient methodology of creating dynamic master/worker applications. In this paper, we introduce the benefits from the use of KappaPi tool for automatic analysis of master/worker applications. First, by the automatic detection of the master/worker paradigm in the application. And second, by the performance analysis of the application focusing on the performance bottlenecks and the limitations of this master/worker collaboration.

international conference on conceptual structures | 2013

n-step FM-Index for Faster Pattern Matching

Alejandro Chacón; Juan C. Moure; Antonio Espinosa; Porfidio Hernández

Fast pattern matching is a requirement for many problems, specially for bioinformatics sequence analysis like short read mapping applications. This work presents a variation of the FM-index method, denoted n-step FM-index, that is applied in exact match genome search. We propose an alternative two-dimensional FM-index structure that allows backward-search navigation giving steps of n symbols at a time. The main advantages of this arrangement are the reduction of the computational work, but most importantly, the reduction by n of the chain of dependent data accesses, and the increase in the temporal locality of the data access pattern. This benefit comes at the expense of increasing the total amount of data required for the index. We present an in-depth performance analysis of a multi-core implementation of the algorithm using large references (up to 1.5G). We identify memory latency as the major performance limiter for single-thread execution and memory bandwidth for multi-thread execution. Our proposal provides speedups ranging from 1.4× to 2.4×, when there is no limitation on DRAM capacity. We also analyse the trade-off of compacting the proposed data structure in order to reduce memory capacity requirements, now at the expense of increasing execution time. An extra 33% of DRAM space allows our proposal to improve performance by 1.2×, while doubling DRAM size enables an additional 1.5×. Our proposal of n-step algorithm provides an alternative for pseudo-random memory access algorithms to be redesigned to scale in current and future computer systems.

Scientific Programming | 2002

Dynamic performance tuning supported by program specification

Eduardo César; Anna Morajko; Tomàs Margalef; Joan Sorribes; Antonio Espinosa; Emilio Luque

Performance analysis and tuning of parallel/distributed applications are very difficult tasks for non-expert programmers. It is necessary to provide tools that automatically carry out these tasks. These can be static tools that carry out the analysis on a post-mortem phase or can tune the application on the fly. Both kind of tools have their target applications. Static automatic analysis tools are suitable for stable application while dynamic tuning tools are more appropriate to applications with dynamic behaviour. In this paper, we describe KappaPi as an example of a static automatic performance analysis tool, and also a general environment based on parallel patterns for developing and dynamically tuning parallel/distributed applications.

Proceedings of the 20th European MPI Users' Group Meeting on | 2013

Job scheduling for optimizing data locality in Hadoop clusters

Aprigio Bezerra; Porfidio Hernández; Antonio Espinosa; Juan C. Moure

We describe the use of non-dedicated clusters by a known group of local applications sharing the computational resources with additional bioinformatics MapReduce applications. We have studied how to effectively use the resources shared by both application types during their execution. In order to keep local application execution times unaffected we consider the configuration of a group of parameters of the Hadoop platform. One of the most relevant aspects to consider is the job scheduling policy. Our aim is to allow that tasks from different jobs that handle the same data blocks are grouped to be run on the same node where the blocks are allocated. Experimental results show that our approach outperforms traditional policies.

european conference on parallel processing | 2000

Integrating Automatic Techniques in a Performance Analysis Session

Antonio Espinosa; Tomàs Margalef; Emilio Luque

In this paper, we describe the use of an automatic performance analysis tool for describing the behaviour of a parallel application. KappaPi tool includes a list of techniques that may help the non-expert users in finding the most important performance problems of their applications. As an example, the tool is used to tune the performance of a parallel simulation of a forest fire propagation model.

Human Mutation | 2016

From Wet-Lab to Variations: Concordance and Speed of Bioinformatics Pipelines for Whole Genome and Whole Exome Sequencing.

Steve Laurie; Marcos Fernandez-Callejo; Santiago Marco-Sola; Jean-Rémi Trotta; Jordi Camps; Alejandro Chacón; Antonio Espinosa; Marta Gut; Ivo Gut; Simon Heath; Sergi Beltran

As whole genome sequencing becomes cheaper and faster, it will progressively substitute targeted next‐generation sequencing as standard practice in research and diagnostics. However, computing cost–performance ratio is not advancing at an equivalent rate. Therefore, it is essential to evaluate the robustness of the variant detection process taking into account the computing resources required. We have benchmarked six combinations of state‐of‐the‐art read aligners (BWA‐MEM and GEM3) and variant callers (FreeBayes, GATK HaplotypeCaller, SAMtools) on whole genome and whole exome sequencing data from the NA12878 human sample. Results have been compared between them and against the NIST Genome in a Bottle (GIAB) variants reference dataset. We report differences in speed of up to 20 times in some steps of the process and have observed that SNV, and to a lesser extent InDel, detection is highly consistent in 70% of the genome. SNV, and especially InDel, detection is less reliable in 20% of the genome, and almost unfeasible in the remaining 10%. These findings will aid in choosing the appropriate tools bearing in mind objectives, workload, and computing infrastructure available.

european pvm mpi users group meeting on recent advances in parallel virtual machine and message passing interface | 1999

Relating the Execution Behaviour with the Structure of the Application

Antonio Espinosa; F. Parcerisa; Tomàs Margalef; Emilio Luque

Traditional parallel programming forces the programmer to understand the enormous amount of performance information obtained from the execution of a program. In this paper, we show how the use of KappaPi automatic analysis tool helps the programmers of applications to avoid this difficult task. In the last stage of the analysis we discuss the possibilities of establishing relationships between the performance information found and the programming structure of the application.

The Journal of Supercomputing | 2012

Analysis and improvement of map-reduce data distribution in read mapping applications

Antonio Espinosa; Porfidio Hernández; Juan C. Moure; J. Protasio; Ana Ripoll

The map-reduce paradigm has shown to be a simple and feasible way of filtering and analyzing large data sets in cloud and cluster systems. Algorithms designed for the paradigm must implement regular data distribution patterns so that appropriate use of resources is ensured. Good scalability and performance on Map-Reduce applications greatly depend on the design of regular intermediate data generation-consumption patterns at the map and reduce phases. We describe the data distribution patterns found in current Map-Reduce read mapping bioinformatics applications and show some data decomposition principles to greatly improve their scalability and performance

international conference on computational science | 2016

GPU-based Pedestrian Detection for Autonomous Driving

V. Campmany; S. Silva; Antonio Espinosa; Juan C. Moure; D. Vzquez; A.M. Lpez

We propose a real-time pedestrian detection system for the embedded Nvidia Tegra X1 GPU-CPU hybrid platform. The detection pipeline is composed by the following state-of-the-art algorithms: features extracted from the input image are Histograms of Local Binary Patterns (LBP) and Histograms of Oriented Gradients (HOG); candidate generation using Pyramidal Sliding Window technique; and classification with Support Vector Machine (SVM). Experimental results show that the Tegra ARM platform is two times more energy efficient than a desktop GPU and at least 8 times faster than a desktop multicore CPU.

Explore More