Rafael C. Carrasco
University of Alicante
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Rafael C. Carrasco.
international colloquium on grammatical inference | 1994
Rafael C. Carrasco; Jose Oncina
We propose a new algorithm which allows for the identification of any stochastic deterministic regular language as well as the determination of the probabilities of the strings in the language. The algorithm builds the prefix tree acceptor from the sample set and merges systematically equivalent states. Experimentally, it proves very fast and the time needed grows only linearly with the size of the sample set.
IEEE Transactions on Pattern Analysis and Machine Intelligence | 2005
Enrique Vidal; Franck Thollard; C. de la Higuera; Francisco Casacuberta; Rafael C. Carrasco
Probabilistic finite-state machines are used today in a variety of areas in pattern recognition or in fields to which pattern recognition is linked. In part I of this paper, we surveyed these objects and studied their properties. In this part, we study the relations between probabilistic finite-state automata and other well-known devices that generate strings like hidden Markov models and n-grams and provide theorems, algorithms, and properties that represent a current state of the art of these objects.
Theoretical Informatics and Applications | 1999
Rafael C. Carrasco; Jose Oncina
In this paper, the identification of stochastic regular languages is addressed. For this purpose, we propose a class of algorithms which allow for the identification of the structure of the minimal stochastic automaton generating the language. It is shown that the time needed grows only linearly with the size of the sample set and a measure of the complexity of the task is provided. Experimentally, our implementation proves very fast for application purposes.
Pattern Recognition Letters | 1996
Luisa Micó; Jose Oncina; Rafael C. Carrasco
The recently introduced algorithm LAESA finds the nearest neighbour prototype in a metric space. The average number of distances computed in the algorithm does not depend on the number of prototypes but it shows linear space and time complexities. In this paper, a new algorithm (TLAESA) is proposed which has a sublinear time complexity and keeps the other features unchanged.
processing of the portuguese language | 2006
Carme Armentano-Oller; Rafael C. Carrasco; Antonio M. Corbí-Bellot; Mikel L. Forcada; Mireia Ginestí-Rosell; Sergio Ortiz-Rojas; Juan Antonio Pérez-Ortiz; Gema Ramírez-Sánchez; Felipe Sánchez-Martínez; Miriam A. Scalco
This paper describes the current status of development of an open-source shallow-transfer machine translation (MT) system for the [European] Portuguese
Computational Linguistics | 2002
Rafael C. Carrasco; Mikel L. Forcada
\leftrightarrow
Neural Computation | 1995
Mikel L. Forcada; Rafael C. Carrasco
Spanish language pair, developed using the OpenTrad Apertium MT toolbox (www.apertium.org). Apertium uses finite-state transducers for lexical processing, hidden Markov models for part-of-speech tagging, and finite-state-based chunking for structural transfer, and is based on a simple rationale: to produce fast, reasonably intelligible and easily correctable translations between related languages, it suffices to use a MT strategy which uses shallow parsing techniques to refine word-for-word MT. This paper briefly describes the MT engine, the formats it uses for linguistic data, and the compilers that convert these data into an efficient format used by the engine, and then goes on to describe in more detail the pilot Portuguese
Neural Computation | 2000
Rafael C. Carrasco; Mikel L. Forcada; M. Ángeles Valdés-Muòoz; Ramón P. Ñeco
\leftrightarrow
international colloquium on grammatical inference | 1998
Rafael C. Carrasco; Jose Oncina; Jorge Calera-Rubio
Spanish linguistic data.
Theoretical Informatics and Applications | 1997
Rafael C. Carrasco
Daciuk et al. [Computational Linguistics 26(1):316 (2000)] describe a method for constructing incrementally minimal, deterministic, acyclic finite-state automata (dictionaries) from sets of strings. But acyclic finite-state automata have limitations: For instance, if one wants a linguistic application to accept all possible integer numbers or Internet addresses, the corresponding finite-state automaton has to be cyclic. In this article, we describe a simple and equally efficient method for modifying any minimal finite-state automaton (be it acyclic or not) so that a string is added to or removed from the language it accepts; both operations are very important when dictionary maintenance is performed and solve the dictionary construction problem addressed by Daciuk et al. as a special case. The algorithms proposed here may be straightforwardly derived from the customary textbook constructions for the intersection and the complementation of finite-state automata; the algorithms exploit the special properties of the automata resulting from the intersection operation when one of the finite-state automata accepts a single string.