Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Veronica Gil-Costa is active.

Publication


Featured researches published by Veronica Gil-Costa.


high performance distributed computing | 2010

New caching techniques for web search engines

Mauricio Marin; Veronica Gil-Costa; Carlos Gómez-Pantoja

This paper proposes a cache hierarchy that enables Web search engines to efficiently process user queries. The different caches in the hierarchy are used to store pieces of data which are useful to solve frequent queries. Cached items range from specific data such as query answers to generic data such as segments of index retrieved from secondary memory. The paper also presents a comparative study based on discrete-event simulation and bulk-synchronous parallelism. The studied performance metrics include overall query throughput, single-user query latency and power consumption. In all cases, the results show that the proposed cache hierarchy leads to better performance than a baseline approach built on state of the art caching techniques.


Journal of Discrete Algorithms | 2009

Parallel query processing on distributed clustering indexes

Veronica Gil-Costa; Mauricio Marin; Nora Reyes

Similarity search has been proved suitable for searching in large collections of unstructured data objects. A number of practical index data structures for this purpose have been proposed. All of them have been devised to process single queries sequentially. However, in large-scale systems such as Web Search Engines indexing multi-media content, it is critical to deal efficiently with streams of queries rather than with single queries. In this paper we show how to achieve efficient and scalable performance in this context. To this end we transform a sequential index based on clustering into a distributed one and devise algorithms and optimizations specially tailored to support high-performance parallel query processing.


conference on information and knowledge management | 2007

High-performance distributed inverted files

Mauricio Marin; Veronica Gil-Costa

We present a general method of parallel query processing that allows scalable performance on distributed inverted files. The method allows the realization of a hybrid that combines the advantages of the document and term partitioned inverted files.


parallel computing | 2010

Sync/Async parallel search for the efficient design and construction of web search engines

Mauricio Marin; Veronica Gil-Costa; Carolina Bonacic; Ricardo A. Baeza-Yates; Isaac D. Scherson

A parallel query processing method is proposed for the design and construction of web search engines to efficiently deal with dynamic variations in query traffic. The method allows for the efficient use of different distributed indexing and query processing strategies in server clusters consisting of multiple computational/storage nodes. It also enables a better utilization of local and distributed hardware resources as it automatically re-organizes parallel computations to benefit from the advantages of two mixed modes of operation, namely: a newly proposed synchronous mode and the standard asynchronous computing mode. Switching between modes is facilitated by a round-robin strategy devised to grant each query a fair share of the hardware resources and properly predict query throughput. Performance is evaluated by experimental methods and two case studies serve to show how to develop efficient parallel query processing algorithms for large-scale search engines based on the proposed paradigm.


european conference on parallel processing | 2008

A Search Engine Index for Multimedia Content

Mauricio Marin; Veronica Gil-Costa; Carolina Bonacic

We present a distributed index data structure and algorithms devised to support parallel query processing of multimedia content in search engines. We present a comparative study with a number of data structures used as indexes for metric space databases. Our optimization criteria are based on requirements for high-performance search engines. The main advantages of our proposal are efficient performance with respect to other approaches (sequentially and in parallel), suitable treatment of secondary memory, and support for OpenMP multithreading. We presents experiments for the asynchronous (MPI) and bulk-synchronous (BSP) message passing models of parallel computing showing that in both models our approach outperforms others consistently.


parallel, distributed and network-based processing | 2010

Scheduling Metric-Space Queries Processing on Multi-Core Processors

Veronica Gil-Costa; Ricardo J. Barrientos; Mauricio Marin; Carolina Bonacic

This paper proposes a strategy to organize metricspace query processing in multicore search nodes as understood in the context of search engines running on clusters of computers. The strategy is applied in each search node to process all active queries visiting the node as part of their solution which, in general, for each query is computed from the contribution of each search node. When query traffic is high enough, the proposed strategy assigns one thread to each query and lets them work in a fully asynchronous manner. When query traffic is moderate or low, some threads start to idle so they are put to work on queries being processed by other threads. The strategy solves the associated synchronization problem among threads by switching query processing into a bulk-synchronous mode of operation. This simplifies the dynamic re-organization of threads and overheads are very small with the advantage that the overall work-load is evenly distributed across all threads.


international conference on computational science | 2008

Hybrid Index for Metric Space Databases

Mauricio Marin; Veronica Gil-Costa; Roberto Uribe

We present an index data structure for metric-space databases. The proposed method has the advantage of allowing an efficient use of secondary memory. In the case of index entirely loaded in main memory our strategy achieves competitive performance. Our experimental study shows that the proposed index outperforms other strategies known to be efficient in practice. A valuable feature of the proposal is that the index can be dynamically updated once constructed.


applications and theory of petri nets | 2012

Capacity planning for vertical search engines: an approach based on coloured petri nets

Veronica Gil-Costa; Jair Lobos; Alonso Inostrosa-Psijas; Mauricio Marin

This paper proposes a Colored Petri Net model capturing the behaviour of vertical search engines. In such systems a query submitted by a user goes through different stages and can be handled by three different kinds of nodes. The proposed model has a modular design that enables accommodation of alternative/additional search engine components. A performance evaluation study is presented to illustrate the use of the model and it shows that the proposed model is suitable for rapid exploration of different scenarios and determination of feasible search engine configurations.


winter simulation conference | 2014

DEVs modeling of large scale web search engines

Alonso Inostrosa-Psijas; Gabriel A. Wainer; Veronica Gil-Costa; Mauricio Marin

Modeling large scale Web Search Engines (WSEs) is a complex task. It involves many issues such as representing users behavior, query traffic, several strategies and heuristics to improve query response time, etc. Typically, WSEs are composed of several services deployed in data centers, which must interact to get the best document results to user queries. Additionally, hardware specification like multithreading and network communications have to be taken into account. In this paper, we propose to model a servicebased WSE using the Discrete Event System Specification (DEVS) formalism, which is one of the most powerful methodologies for discrete event systems. We validate our proposed model against an actual MPI implementation of the WSE and a process oriented simulation. We evaluate the accuracy of the proposed model by evaluating metrics such as query throughput and we show that there is no relevant differences, just small fluctuations of less than 4%.


international conference on parallel processing | 2010

Distributing a Metric-Space Search Index onto Processors

Mauricio Marin; Flavio Ferrarotti; Veronica Gil-Costa

This paper studies the problem of distributing a metric-space search index based on compact clustering onto a set of distributed memory processors. The aim is enabling efficient similarity search in large-scale Web search engines. The index data structure is composed of a set of clusters enclosing the database objects and we propose distribution methods based on two different solution approaches. The first one makes use of specific knowledge about the work-load generated by user queries. Here the challenge is how to represent and use such a knowledge into a method capable of producing a cluster distribution leading to high performance. The second one follows a novel direction by completely disregarding user behavior to look instead at the relationships among the index clusters themselves to decide their placement onto processors. Both methods perform efficiently depending on the context and they are generic enough to be applied to different distributed index data structures for metric-space databases.

Collaboration


Dive into the Veronica Gil-Costa's collaboration.

Top Co-Authors

Avatar

Carolina Bonacic

Complutense University of Madrid

View shared research outputs
Top Co-Authors

Avatar

Hermes Senger

Federal University of São Carlos

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Romina Molina

National University of San Luis

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Cesar Marcondes

Federal University of São Carlos

View shared research outputs
Top Co-Authors

Avatar

Marcela Printista

National University of San Luis

View shared research outputs
Top Co-Authors

Avatar

Michel J. Mizrahi

University of Buenos Aires

View shared research outputs
Researchain Logo
Decentralizing Knowledge