Humberto González-Díaz
University of the Basque Country
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Humberto González-Díaz.
Proteomics | 2008
Humberto González-Díaz; Yenny González-Díaz; Lourdes Santana; Florencio M. Ubeira; Eugenio Uriarte
Describing the connectivity of chemical and/or biological systems using networks is a straight gate for the introduction of mathematical tools in proteomics. Networks, in some cases even very large ones, are simple objects that are composed at least by nodes and edges. The nodes represent the parts of the system and the edges geometric and/or functional relationships between parts. In proteomics, amino acids, proteins, electrophoresis spots, polypeptidic fragments, or more complex objects can play the role of nodes. All of these networks can be numerically described using the so‐called Connectivity Indices (CIs). The transformation of graphs (a picture) into CIs (numbers) facilitates the manipulation of information and the search for structure‐function relationships in Proteomics. In this work, we review and comment on the challenges and new trends in the definition and applications of CIs in Proteomics. Emphasis is placed on 1‐D‐CIs for DNA and protein sequences, 2‐D‐CIs for RNA secondary structures, 3‐D‐topographic indices (TPGIs) for protein function annotation without alignment, 2‐D‐CIs and 3‐D‐TPGIs for the study of drug‐protein or drug‐RNA quantitative structure‐binding relationships, and pseudo 3‐D‐CIs for protein surface molecular recognition. We also focus on CIs to describe Protein Interaction Networks or RNA co‐expression networks. 2‐D‐CIs for patient blood proteome 2‐DE maps or mass spectra are also covered.
Bioorganic & Medicinal Chemistry | 2008
Francisco J. Prado-Prado; Humberto González-Díaz; Octavio Martínez de la Vega; Florencio M. Ubeira; Kuo-Chen Chou
Several pathogen parasite species show different susceptibilities to different antiparasite drugs. Unfortunately, almost all structure-based methods are one-task or one-target Quantitative Structure-Activity Relationships (ot-QSAR) that predict the biological activity of drugs against only one parasite species. Consequently, multi-tasking learning to predict drugs activity against different species by a single model (mt-QSAR) is vitally important. In the two previous works of the present series we reported two single mt-QSAR models in order to predict the antimicrobial activity against different fungal (Bioorg. Med. Chem.2006, 14, 5973-5980) or bacterial species (Bioorg. Med. Chem.2007, 15, 897-902). These mt-QSARs offer a good opportunity (unpractical with ot-QSAR) to construct drug-drug similarity Complex Networks and to map the contribution of sub-structures to function for multiple species. These possibilities were unattended in our previous works. In the present work, we continue this series toward other important direction of chemotherapy (antiparasite drugs) with the development of an mt-QSAR for more than 500 drugs tested in the literature against different parasites. The data were processed by Linear Discriminant Analysis (LDA) classifying drugs as active or non-active against the different tested parasite species. The model correctly classifies 212 out of 244 (87.0%) cases in training series and 207 out of 243 compounds (85.4%) in external validation series. In order to illustrate the performance of the QSAR for the selection of active drugs we carried out an additional virtual screening of antiparasite compounds not used in training or predicting series; the model recognized 97 out of 114 (85.1%) of them. We also give the procedures to construct back-projection maps and to calculate sub-structures contribution to the biological activity. Finally, we used the outputs of the QSAR to construct, by the first time, a multi-species Complex Networks of antiparasite drugs. The network predicted has 380 nodes (compounds), 634 edges (pairs of compounds with similar activity). This network allows us to cluster different compounds and identify on average three known compounds similar to a new query compound according to their profile of biological activity. This is the first attempt to calculate probabilities of antiparasitic action of drugs against different parasites.
Current Topics in Medicinal Chemistry | 2008
Humberto González-Díaz; Francisco J. Prado-Prado; Florencio M. Ubeira
The method MARCH-INSIDE (MARkovian CHemicals IN SIlico DEsign) is a simple but efficient computational approach to the study of Quantitative Structure-Activity Relationships (QSAR) in Medicinal Chemistry. The method uses the theory of Markov Chains to generate parameters that numerically describe the chemical structure of drugs and drug targets. This approach generates two principal types of parameters Stochastic Topological Indices (sto-TIs) and stochastic 3D-Topographic Indices (sto-TPGIs). The use of these parameters allows the rapid collection, annotation, retrieval, comparison and mining of molecular and macromolecular chemical structures within large databases. In the work described here, we review and comment on the several applications of MARCH-INSIDE to the Medicinal Chemistry of Antimicrobial agents as well as their molecular targets. First we revised the use of classic sto-TIs to predict antiparasite compounds for the treatment of Fascioliasis. Next, we revised the use of chiral sto-TIs (sto-CTIs) to predict new antibacterial, antiviral and anti-coccidial compounds. After that, we review multi-target sto-TIs (mt-sto-TIs), which unifying QSAR models predicting antifungal, antibacterial, or anti-parasite drugs with multiple targets (microbial species). We also discussed the uses of mt-sto-TIs to assemble drug-drug similarity Complex Networks of antimicrobial compounds based on molecular structure. Last, we review the use of MARCH-INSIDE to generate macromolecular TIs and TPGIs for proteins or RNA targets for antimicrobial drugs.
FEBS Letters | 2006
Guillermin Agüero-Chapin; Humberto González-Díaz; Reinaldo Molina; Javier T. Varona-Santos; Eugenio Uriarte; Yenny González-Díaz
The development of 2D graph‐theoretic representations for DNA sequences was very important for qualitative and quantitative comparison of sequences. Calculation of numeric features for these representations is useful for DNA–QSAR studies. Most of all graph‐theoretic representations identify each one of the four bases with a unitary walk in one axe direction in the 2D space. In the case of proteins, twenty amino acids instead of four bases have to be considered. This fact has limited the introduction of useful 2D Cartesian representations and the corresponding sequences descriptors to encode protein sequence information. In this study, we overcome this problem grouping amino acids into four groups: acid, basic, polar and non‐polar amino acids. The identification of each group with one of the four axis directions determines a novel 2D representation and numeric descriptors for proteins sequences. Afterwards, a Markov model has been used to calculate new numeric descriptors of the protein sequence. These descriptors are called herein the sequence 2D coupling numbers (ζ k ). In this work, we calculated the ζ k values for 108 sequences of different polygalacturonases (PGs) and for 100 sequences of other proteins. A Linear Discriminant Analysis model derived here (PG = 5.36 · ζ 1 − 3.98 · ζ 3 − 42.21) successfully discriminates between PGs and other proteins. The model correctly classified 100% of a subset of 81 PGs and 75 non‐PG proteins sequences used to train the model. The model also correctly classified 51 out of 52 (98.07%) of proteins sequences used as external validation series. The uses of different group of amino acids and/or axes orientation give different results, so it is suggested to be explored for other databases. Finally, to illustrates the use of the model we report the isolation and prediction of the PG action for a novel sequence (AY908988) isolated by our group from Psidium guajava L. This prediction coincides very well with sequence alignment results found by the BLAST methodology. These findings illustrate the possibilities of the sequence descriptors derived for this novel 2D sequence representation in proteins sequence QSAR studies.
Journal of Medicinal Chemistry | 2008
Lourdes Santana; Humberto González-Díaz; Elías Quezada; Eugenio Uriarte; Matilde Yáñez; Dolores Viña; Francisco Orallo
The work provides a new model for the prediction of the MAO-A and -B inhibitor activity by the use of combined complex networks and QSAR methodologies. On the basis of the obtained model, we prepared and assayed 33 coumarin derivatives, and the theoretical prediction was compared with the experimental activity data. The model correctly predicted 27 compounds, and most of the active derivatives showed IC 50 values in the muM-nM range against both the MAO-A and MAO-B isoforms. Compound 14 shows the same MAO-A inhibitory activity (IC 50 = 7.2 nM), as clorgyline used as a reference inhibitor and has the highest MAO-A specificity (1000-fold higher compared to MAO-B). On the other hand, compounds 24 (IC 50 = 1.2 nM) and 28 (IC 50 = 1.5 nM) show higher activity than selegiline (IC 50 = 19.6 nM) and high MAO-B selectivity with 100-fold and 1600-fold inhibition levels, with respect to the MAO-A isoform.
Journal of Computational Chemistry | 2008
Humberto González-Díaz; Francisco J. Prado-Prado
There are many pathogen microbial species with very different antimicrobial drugs susceptibility. In this work, we selected pairs of antifungal drugs with similar/dissimilar species predicted‐activity profile and represented it as a large network, which may be used to identify drugs with similar mechanism of action. Computational chemistry prediction of the biological activity based on quantitative structure–activity relationships (QSAR) susbtantially increases the potentialities of this kind of networks, avoiding time and resource‐consuming experiments. Unfortunately, most QSAR models are unspecific or predict activity against only one species. To solve this problem we developed a multispecies QSAR classification model, in which the outputs were the inputs of the aforementioned network. Overall model classification accuracy was 87.0% (161/185 compounds) in training, 83.4% (50/61) in validation, and 83.7% for 288 additional antifungal compounds used to extend model validation for network construction. The network predicted has 59 nodes (compounds), 648 edges (pairs of compounds with similar activity), low coverage density d = 37.8%, and distribution more close to normal than to exponential. These results are more characteristic of a not‐overestimated random network, clustering different drug mechanisms of actions, than of a less useful power law network with few mechanisms (network hubs).
Bioorganic & Medicinal Chemistry | 2009
Francisco J. Prado-Prado; Octavio Martínez de la Vega; Eugenio Uriarte; Florencio M. Ubeira; Kuo-Chen Chou; Humberto González-Díaz
One limitation of almost all antiviral Quantitative Structure-Activity Relationships (QSAR) models is that they predict the biological activity of drugs against only one species of virus. Consequently, the development of multi-tasking QSAR models (mt-QSAR) to predict drugs activity against different species of virus is of the major vitally important. These mt-QSARs offer also a good opportunity to construct drug-drug Complex Networks (CNs) that can be used to explore large and complex drug-viral species databases. It is known that in very large CNs we can use the Giant Component (GC) as a representative sub-set of nodes (drugs) and but the drug-drug similarity function selected may strongly determines the final network obtained. In the three previous works of the present series we reported mt-QSAR models to predict the antimicrobial activity against different fungi [Gonzalez-Diaz, H.; Prado-Prado, F. J.; Santana, L.; Uriarte, E. Bioorg.Med.Chem.2006, 14, 5973], bacteria [Prado-Prado, F. J.; Gonzalez-Diaz, H.; Santana, L.; Uriarte E. Bioorg.Med.Chem.2007, 15, 897] or parasite species [Prado-Prado, F.J.; González-Díaz, H.; Martinez de la Vega, O.; Ubeira, F.M.; Chou K.C. Bioorg.Med.Chem.2008, 16, 5871]. However, including these works, we do not found any report of mt-QSAR models for antivirals drug, or a comparative study of the different GC extracted from drug-drug CNs based on different similarity functions. In this work, we used Linear Discriminant Analysis (LDA) to fit a mt-QSAR model that classify 600 drugs as active or non-active against the 41 different tested species of virus. The model correctly classifies 143 of 169 active compounds (specificity=84.62%) and 119 of 139 non-active compounds (sensitivity=85.61%) and presents overall training accuracy of 85.1% (262 of 308 cases). Validation of the model was carried out by means of external predicting series, classifying the model 466 of 514, 90.7% of compounds. In order to illustrate the performance of the model in practice, we develop a virtual screening recognizing the model as active 92.7%, 102 of 110 antivirus compounds. These compounds were never use in training or predicting series. Next, we obtained and compared the topology of the CNs and their respective GCs based on Euclidean, Manhattan, Chebychey, Pearson and other similarity measures. The GC of the Manhattan network showed the more interesting features for drug-drug similarity search. We also give the procedure for the construction of Back-Projection Maps for the contribution of each drug sub-structure to the antiviral activity against different species.
Bioorganic & Medicinal Chemistry | 2010
Francisco J. Prado-Prado; Xerardo García-Mera; Humberto González-Díaz
There are many of pathogen parasite species with different susceptibility profile to antiparasitic drugs. Unfortunately, almost QSAR models predict the biological activity of drugs against only one parasite species. Consequently, predicting the probability with which a drug is active against different species with a single unify model is a goal of the major importance. In so doing, we use Markov Chains theory to calculate new multi-target spectral moments to fit a QSAR model that predict by the first time a mt-QSAR model for 500 drugs tested in the literature against 16 parasite species and other 207 drugs no tested in the literature using spectral moments. The data was processed by linear discriminant analysis (LDA) classifying drugs as active or non-active against the different tested parasite species. The model correctly classifies 311 out of 358 active compounds (86.9%) and 2328 out of 2577 non-active compounds (90.3%) in training series. Overall training performance was 89.9%. Validation of the model was carried out by means of external predicting series. In these series the model classified correctly 157 out 190, 82.6% of antiparasitic compounds and 1151 out of 1277 non-active compounds (90.1%). Overall predictability performance was 89.2%. In addition we developed four types of non Linear Artificial neural networks (ANN) and we compared with the mt-QSAR model. The improved ANN model had an overall training performance was 87%. The present work report the first attempts to calculate within a unify framework probabilities of antiparasitic action of drugs against different parasite species based on spectral moment analysis.
Molecular Pharmaceutics | 2009
Dolores Viña; Eugenio Uriarte; Francisco Orallo; Humberto González-Díaz
There are many drugs described with very different affinity to a large number of receptors. In this work, we selected drug-receptor pairs (DRPs) of affinity/nonaffinity drugs to similar/dissimilar receptors and we represented them as a large network, which may be used to identify drugs that can act on a receptor. Computational chemistry prediction of the biological activity based on quantitative structure-activity relationships (QSAR) substantially increases the potentialities of this kind of networks avoiding time- and resource-consuming experiments. Unfortunately, most QSAR models are unspecific or predict activity against only one receptor. To solve this problem, we developed here a multitarget QSAR (mt-QSAR) classification model. Overall model classification accuracy was 72.25% (1390/1924 compounds) in training, 72.28% (459/635) in cross-validation. Outputs of this mt-QSAR model were used as inputs to construct a network. The observed network has 1735 nodes (DRPs), 1754 edges or pairs of DRPs with similar drug-target affinity (sPDRPs), and low coverage density d = 0.12%. The predicted network has 1735 DRPs, 1857 sPDRPs, and also low coverage density d = 0.12%. After an edge-to-edge comparison (chi-square = 9420.3; p < 0.005), we have demonstrated that the predicted network is significantly similar to the one observed and both have a distribution closer to exponential than to normal.
Journal of Computational Chemistry | 2007
Maykel Cruz-Monteagudo; Humberto González-Díaz; Guillermin Agüero-Chapin; Lourdes Santana; Fernanda Borges; Elena Rosa Dominguez; Gianni Podda; Eugenio Uriarte
Predicting tissue and environmental distribution of chemicals is of major importance for environmental and life sciences. Most of the molecular descriptors used in computational prediction of chemicals partition behavior consider molecular structure but ignore the nature of the partition system. Consequently, computational models derived up‐to‐date are restricted to the specific system under study. Here, a free energy‐based descriptor (ΔGk) is introduced, which circumvent this problem. Based on ΔGk, we developed for the first time a single linear classification model to predict the partition behavior of a broad number of structurally diverse drugs and other chemicals (1300) for 38 different partition systems of biological and environmental significance. The model presented training/predicting set accuracies of 91.79/88.92%. Parametrical assumptions were checked. Desirability analysis was used to explore the levels of the predictors that produce the most desirable partition properties. Finally, inversion of the partition direction for each one of the 38 partition systems evidences that our models correctly classified 89.08% of compounds with an uncertainty of only ±0.17% independently of the direction of the partition process used to seek the model. Other 10 different classification models (linear, neural networks, and genetic algorithms) were also tested for the same purposes. None of these computational models favorably compare with respect to the linear model indicating that our approach capture the main aspects that govern chemicals partition in different systems.