José M. Sempere
Polytechnic University of Valencia
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by José M. Sempere.
Nucleic Acids Research | 2011
Carlos Llorens; Ricardo Futami; Laura Covelli; Laura Domínguez-Escribá; Jose M. Viu; Daniel Tamarit; José Aguilar-Rodríguez; Miguel Vicente-Ripolles; Gonzalo Fuster; Guillermo P. Bernet; Florian Maumus; Alfonso Muñoz-Pomer; José M. Sempere; Amparo Latorre; Andrés Moya
This article introduces the second release of the Gypsy Database of Mobile Genetic Elements (GyDB 2.0): a research project devoted to the evolutionary dynamics of viruses and transposable elements based on their phylogenetic classification (per lineage and protein domain). The Gypsy Database (GyDB) is a long-term project that is continuously progressing, and that owing to the high molecular diversity of mobile elements requires to be completed in several stages. GyDB 2.0 has been powered with a wiki to allow other researchers participate in the project. The current database stage and scope are long terminal repeats (LTR) retroelements and relatives. GyDB 2.0 is an update based on the analysis of Ty3/Gypsy, Retroviridae, Ty1/Copia and Bel/Pao LTR retroelements and the Caulimoviridae pararetroviruses of plants. Among other features, in terms of the aforementioned topics, this update adds: (i) a variety of descriptions and reviews distributed in multiple web pages; (ii) protein-based phylogenies, where phylogenetic levels are assigned to distinct classified elements; (iii) a collection of multiple alignments, lineage-specific hidden Markov models and consensus sequences, called GyDB collection; (iv) updated RefSeq databases and BLAST and HMM servers to facilitate sequence characterization of new LTR retroelement and caulimovirus queries; and (v) a bibliographic server. GyDB 2.0 is available at http://gydb.org.
Acta Informatica | 2003
Juan Castellanos; Carlos Martín-Vide; Victor Mitrana; José M. Sempere
Abstract. In this paper we consider networks of evolutionary processors as language generating and computational devices. When the filters are regular languages one gets the computational power of Turing machines with networks of size at most six, depending on the underlying graph. When the filters are defined by random context conditions, we obtain an incomparability result with the families of regular and context-free languages. Despite their simplicity, we show how the latter networks might be used for solving an NP-complete problem, namely the “3-colorability problem”, in linear time and linear resources (nodes, symbols, rules).
international work conference on artificial and natural neural networks | 2001
Juan Castellanos; Carlos Martín-Vide; Victor Mitrana; José M. Sempere
We propose a computational device based on evolutionary rules and communication within a network, similar to that introduced in [4], called network of evolutionary processors. An NP-complete problem is solved by networks of evolutionary processors of linear size in linear time. Some furher directions of research are finally discussed.
international colloquium on grammatical inference | 1994
José M. Sempere; Pedro García
Even Linear Language class is a subclass of context-free class. In this work we propose a characterization of this class using a relation of finite index. Theorems are provided in order to prove the consistence of the characterization. Finally, we propose a method to learn this class using a reduction to the problem of learning regular languages.
systems man and cybernetics | 2004
Damián López; José M. Sempere; Pedro Alfaro García
In this paper, we study the notion of k-reversibility and k-testability when regular tree languages are involved. We present an inference algorithm for learning a k-testable tree language that runs in polynomial time with respect to the size of the sample used. We also study the tree language classes in relation to other well known ones, and some properties of these languages are proven.
Lecture Notes in Computer Science | 2003
Peter Leupold; Victor Mitrana; José M. Sempere
We consider two types of languages defined by a string through iterative factor duplications, inspired by the process of tandem repeats production in the evolution of DNA. We investigate some decidability matters concerning the unbounded duplication languages and then fix the place of bounded duplication languages in the Chomsky hierarchy by showing that all these languages are context-free. We give some conditions for the non-regularity of these languages. Finally, we discuss some open problems and directions for further research.
international colloquium on grammatical inference | 2000
Henning Fernau; José M. Sempere
We discuss two versatile methods which can be used to transfer learnability results from one language class to another. We apply these methodologies to three learning paradigms: (1) Learning in the limit, (2) Morphic generator grammar inference, and (3) Query learning.
international colloquium on grammatical inference | 1998
José M. Sempere; G. Nagaraja
A method to infer a subclass of linear languages from positive structural information (i.e. skeletons) is presented. The characterization of the class and the analysis of the time and space complexity of the algorithm is exposed too. The new class, Terminal and Structural Distinguishable Linear Languages (TSDLL), is defined through an algebraic characterization and a pumping lemma. We prove that the proposed algorithm correctly identifies any TSDL language in the limit if structural information is presented. Furthermore, we give a definition of a characteristic structural set for any target grammar. Finally we present the conclusions of the work and some guidelines for future works.
international colloquium on grammatical inference | 2006
Piedachu Peris; Damián López; Marcelino Campos; José M. Sempere
The rapid growth of protein sequence databases is exceeding the capacity of biochemically and structurally characterizing new proteins. Therefore, it is very important the development of tools to locate, within protein sequences, those subsequences with an associated function or specific feature. In our work, we propose a method to predict one of those functional motifs (coiled coil), related with protein interaction. Our approach uses even linear languages inference to obtain a transductor which will be used to label unknown sequences. The experiments carried out show that our method outperforms the results of previous approaches.
International Journal of Pattern Recognition and Artificial Intelligence | 2000
Damián López; José M. Sempere; Pedro García
To undertake a syntactic approach to a pattern recognition problem, it is necessary to have good grammatical models as well as good parsing algorithms that allow distorted samples to be classified. There are several methods that obtain, by taking two trees as input, the editing distance between them. In the following work, a polynomial time algorithm which processes the distance between a tree and a tree automaton is presented. This measure can be used in pattern recognition problems as an error model inside a syntactic classifier.