Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Stephen J. Barigye is active.

Publication


Featured researches published by Stephen J. Barigye.


Journal of Computational Chemistry | 2014

QuBiLS‐MIDAS: A parallel free‐software for molecular descriptors computation based on multilinear algebraic maps

César R. García-Jacas; Yovani Marrero-Ponce; Liesner Acevedo-Martínez; Stephen J. Barigye; José R. Valdés-Martiní; Ernesto Contreras-Torres

The present report introduces the QuBiLS‐MIDAS software belonging to the ToMoCoMD‐CARDD suite for the calculation of three‐dimensional molecular descriptors (MDs) based on the two‐linear (bilinear), three‐linear, and four‐linear (multilinear or N‐linear) algebraic forms. Thus, it is unique software that computes these tensor‐based indices. These descriptors, establish relations for two, three, and four atoms by using several (dis‐)similarity metrics or multimetrics, matrix transformations, cutoffs, local calculations and aggregation operators. The theoretical background of these N‐linear indices is also presented. The QuBiLS‐MIDAS software was developed in the Java programming language and employs the Chemical Development Kit library for the manipulation of the chemical structures and the calculation of the atomic properties. This software is composed by a desktop user‐friendly interface and an Abstract Programming Interface library. The former was created to simplify the configuration of the different options of the MDs, whereas the library was designed to allow its easy integration to other software for chemoinformatics applications. This program provides functionalities for data cleaning tasks and for batch processing of the molecular indices. In addition, it offers parallel calculation of the MDs through the use of all available processors in current computers. The studies of complexity of the main algorithms demonstrate that these were efficiently implemented with respect to their trivial implementation. Lastly, the performance tests reveal that this software has a suitable behavior when the amount of processors is increased. Therefore, the QuBiLS‐MIDAS software constitutes a useful application for the computation of the molecular indices based on N‐linear algebraic maps and it can be used freely to perform chemoinformatics studies.


Molecular Diversity | 2015

IMMAN: free software for information theory-based chemometric analysis

Ricardo W. Pino Urias; Stephen J. Barigye; Yovani Marrero-Ponce; César R. García-Jacas; José R. Valdés-Martiní; Facundo Pérez-Giménez

The features and theoretical background of a new and free computational program for chemometric analysis denominated IMMAN (acronym for Information theory-based CheMoMetrics ANalysis) are presented. This is multi-platform software developed in the Java programming language, designed with a remarkably user-friendly graphical interface for the computation of a collection of information-theoretic functions adapted for rank-based unsupervised and supervised feature selection tasks. A total of 20 feature selection parameters are presented, with the unsupervised and supervised frameworks represented by 10 approaches in each case. Several information-theoretic parameters traditionally used as molecular descriptors (MDs) are adapted for use as unsupervised rank-based feature selection methods. On the other hand, a generalization scheme for the previously defined differential Shannon’s entropy is discussed, as well as the introduction of Jeffreys information measure for supervised feature selection. Moreover, well-known information-theoretic feature selection parameters, such as information gain, gain ratio, and symmetrical uncertainty are incorporated to the IMMAN software (http://mobiosd-hub.com/imman-soft/), following an equal-interval discretization approach. IMMAN offers data pre-processing functionalities, such as missing values processing, dataset partitioning, and browsing. Moreover, single parameter or ensemble (multi-criteria) ranking options are provided. Consequently, this software is suitable for tasks like dimensionality reduction, feature ranking, as well as comparative diversity analysis of data matrices. Simple examples of applications performed with this program are presented. A comparative study between IMMAN and WEKA feature selection tools using the Arcene dataset was performed, demonstrating similar behavior. In addition, it is revealed that the use of IMMAN unsupervised feature selection methods improves the performance of both IMMAN and WEKA supervised algorithms.Graphical abstractGraphic representation for Shannon’s distribution of MD calculating software.


Molecular Diversity | 2014

Trends in information theory-based chemical structure codification

Stephen J. Barigye; Yovani Marrero-Ponce; Facundo Pérez-Giménez; Danail Bonchev

This report offers a chronological review of the most relevant applications of information theory in the codification of chemical structure information, through the so-called information indices. Basically, these are derived from the analysis of the statistical patterns of molecular structure representations, which include primitive global chemical formulae, chemical graphs, or matrix representations. Finally, new approaches that attempt to go “back to the roots” of information theory, in order to integrate other information-theoretic measures in chemical structure coding are discussed.


Journal of Computational Chemistry | 2013

Relations frequency hypermatrices in mutual, conditional and joint entropy-based information indices.

Stephen J. Barigye; Yovani Marrero-Ponce; Yoan Martínez-López; Francisco Torrens; Luis Manuel Artiles-Martínez; Ricardo W. Pino-Urias; Oscar Martínez-Santiago

Graph‐theoretic matrix representations constitute the most popular and significant source of topological molecular descriptors (MDs). Recently, we have introduced a novel matrix representation, named the duplex relations frequency matrix, F, derived from the generalization of an incidence matrix whose row entries are connected subgraphs of a given molecular graph G. Using this matrix, a series of information indices (IFIs) were proposed. In this report, an extension of F is presented, introducing for the first time the concept of a hypermatrix in graph‐theoretic chemistry. The hypermatrix representation explores the n‐tuple participation frequencies of vertices in a set of connected subgraphs of G. In this study we, however, focus on triple and quadruple participation frequencies, generating triple and quadruple relations frequency matrices, respectively. The introduction of hypermatrices allows us to redefine the recently proposed MDs, that is, the mutual, conditional, and joint entropy‐based IFIs, in a generalized way. These IFIs are implemented in GT‐STAF (acronym for Graph Theoretical Thermodynamic STAte Functions), a new module of the TOMOCOMD‐CARDD program. Information theoretic‐based variability analysis of the proposed IFIs suggests that the use of hypermatrices enhances the entropy and, hence, the variability of the previously proposed IFIs, especially the conditional and mutual entropy based IFIs. The predictive capacity of the proposed IFIs was evaluated by the analysis of the regression models, obtained for physico‐chemical properties the partition coefficient (Log P) and the specific rate constant (Log K) of 34 derivatives of 2‐furylethylene. The statistical parameters, for the best models obtained for these properties, were compared to those reported in the literature depicting better performance. This result suggests that the use of the hypermatrix‐based approach, in the redefinition of the previously proposed IFIs, avails yet other valuable tools beneficial in QSPR studies and diversity analysis.


Current Computer - Aided Drug Design | 2013

Shannon's, mutual, conditional and joint entropy information indices: generalization of global indices defined from local vertex invariants.

Stephen J. Barigye; Yovani Marrero-Ponce; Oscar Martínez Santiago; Yoan Martínez López; Facundo Pérez-Giménez; Francisco Torrens

A new mathematical approach is proposed in the definition of molecular descriptors (MDs) based on the application of information theory concepts. This approach stems from a new matrix representation of a molecular graph (G) which is derived from the generalization of an incidence matrix whose row entries correspond to connected subgraphs of a given G, and the calculation of the Shannons entropy, the negentropy and the standardized information content, plus for the first time, the mutual, conditional and joint entropy-based MDs associated with G. We also define strategies that generalize the definition of global or local invariants from atomic contributions (local vertex invariants, LOVIs), introducing related metrics (norms), means and statistical invariants. These invariants are applied to a vector whose components express the atomic information content calculated using the Shannons, mutual, conditional and joint entropybased atomic information indices. The novel information indices (IFIs) are implemented in the program TOMOCOMDCARDD. A principal component analysis reveals that the novel IFIs are capable of capturing structural information not codified by IFIs implemented in the software DRAGON. A comparative study of the different parameters (e.g. subgraph orders and/or types, invariants and class of MDs) used in the definition of these IFIs reveals several interesting results. The mutual entropy-based indices give the best correlation results in modeling of a physicochemical property, namely the partition coefficient of the 34 derivatives of 2-furylethylenes, among the classes of indices investigated in this study. In a comparison with classical MDs it is demonstrated that the new IFIs give good results for various QSPR models.


Journal of Computer-aided Molecular Design | 2012

Derivatives in discrete mathematics: a novel graph-theoretical invariant for generating new 2/3D molecular descriptors. I. Theory and QSPR application

Yovani Marrero-Ponce; Oscar Martínez Santiago; Yoan Martínez López; Stephen J. Barigye; Francisco Torrens

In this report, we present a new mathematical approach for describing chemical structures of organic molecules at atomic-molecular level, proposing for the first time the use of the concept of the derivative (


Journal of Cheminformatics | 2016

Examining the predictive accuracy of the novel 3D N-linear algebraic molecular codifications on benchmark datasets

César R. García-Jacas; Ernesto Contreras-Torres; Yovani Marrero-Ponce; Mario Pupo-Meriño; Stephen J. Barigye; Lisset Cabrera-Leyva


Molecular Informatics | 2015

Multi-Server Approach for High-Throughput Molecular Descriptors Calculation based on Multi-Linear Algebraic Maps.

César R. García-Jacas; Longendri Aguilera-Mendoza; Reisel González-Pérez; Yovani Marrero-Ponce; Liesner Acevedo-Martínez; Stephen J. Barigye; Tatiana Avdeenko

\partial


Sar and Qsar in Environmental Research | 2013

Event-based criteria in GT-STAF information indices: theory, exploratory diversity analysis and QSPR applications

Stephen J. Barigye; Yovani Marrero-Ponce; Y. Martínez López; O. Martínez Santiago; Francisco Torrens; R. Garcia Domenech; Jorge Gálvez


Bioinformatics | 2015

Overlap and diversity in antimicrobial peptide databases: compiling a non-redundant set of sequences

Longendri Aguilera-Mendoza; Yovani Marrero-Ponce; Roberto Tellez-Ibarra; Monica T. Llorente-Quesada; Jesús Salgado; Stephen J. Barigye; Jun Liu

) of a molecular graph (MG) with respect to a given event (E), to obtain a new family of molecular descriptors (MDs). With this purpose, a new matrix representation of the MG, which generalizes graph’s theory’s traditional incidence matrix, is introduced. This matrix, denominated the generalized incidence matrix, Q, arises from the Boolean representation of molecular sub-graphs that participate in the formation of the graph molecular skeleton MG and could be complete (representing all possible connected sub-graphs) or constitute sub-graphs of determined orders or types as well as a combination of these. The Q matrix is a non-quadratic and unsymmetrical in nature, its columns (n) and rows (m) are conditions (letters) and collection of conditions (words) with which the event occurs. This non-quadratic and unsymmetrical matrix is transformed, by algebraic manipulation, to a quadratic and symmetric matrix known as relations frequency matrix, F, which characterizes the participation intensity of the conditions (letters) in the events (words). With F, we calculate the derivative over a pair of atomic nuclei. The local index for the atomic nuclei i, Δi, can therefore be obtained as a linear combination of all the pair derivatives of the atomic nuclei i with all the rest of the j′s atomic nuclei. Here, we also define new strategies that generalize the present form of obtaining global or local (group or atom-type) invariants from atomic contributions (local vertex invariants, LOVIs). In respect to this, metric (norms), means and statistical invariants are introduced. These invariants are applied to a vector whose components are the values Δi for the atomic nuclei of the molecule or its fragments. Moreover, with the purpose of differentiating among different atoms, an atomic weighting scheme (atom-type labels) is used in the formation of the matrix Q or in LOVIs state. The obtained indices were utilized to describe the partition coefficient (Log P) and the reactivity index (Log K) of the 34 derivatives of 2-furylethylenes. In all the cases, our MDs showed better statistical results than those previously obtained using some of the most used families of MDs in chemometric practice. Therefore, it has been demonstrated to that the proposed MDs are useful in molecular design and permit obtaining easier and robust mathematical models than the majority of those reported in the literature. All this range of mentioned possibilities open “the doors” to the creation of a new family of MDs, using the graph derivative, and avail a new tool for QSAR/QSPR and molecular diversity/similarity studies.

Collaboration


Dive into the Stephen J. Barigye's collaboration.

Top Co-Authors

Avatar

Yovani Marrero-Ponce

Universidad San Francisco de Quito

View shared research outputs
Top Co-Authors

Avatar

Matheus P. Freitas

Universidade Federal de Lavras

View shared research outputs
Top Co-Authors

Avatar

Francisco Torrens

Universidad Católica de Valencia San Vicente Mártir

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

César R. García-Jacas

Pontificia Universidad Católica del Ecuador

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Huong Le-Thi-Thu

Vietnam National University

View shared research outputs
Top Co-Authors

Avatar

Mirlaine R. Freitas

Universidade Federal de Lavras

View shared research outputs
Top Co-Authors

Avatar

César R. García-Jacas

Pontificia Universidad Católica del Ecuador

View shared research outputs
Researchain Logo
Decentralizing Knowledge