Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Zakariya Yahya Algamal is active.

Publication


Featured researches published by Zakariya Yahya Algamal.


Expert Systems With Applications | 2015

Penalized logistic regression with the adaptive LASSO for gene selection in high-dimensional cancer classification

Zakariya Yahya Algamal; Muhammad Hisyam Lee

The CBPLR showed superior results in terms of AUR and misclassification rate.In terms of the number of selected genes, the CBPLR outperformed APLR and LASSO.The CBPLR performed remarkably well in stability test.The classification accuracy for the CBPLR method is quite consistent and high. An important application of DNA microarray data is cancer classification. Because of the high-dimensionality problem of microarray data, gene selection approaches are often employed to support the expert systems in diagnostic capability of cancer with high classification accuracy. Penalized logistic regression using the least absolute shrinkage and selection operator (LASSO) is one of the key steps in high-dimensional cancer classification, as gene coefficient estimation and gene selection simultaneously. However, the LASSO has been criticized for being biased in gene selection. The adaptive LASSO (APLR) was originally proposed to overcome the selection bias by assigning a consistent weight to each gene. In high-dimensional data, however, the adaptive LASSO faces practical problems in choosing the type of initial weight. In practice, the LASSO estimator itself has been used as an initial weight. However, this may not be preferable because the LASSO is inconsistent in itself. To address this issue, an alternative initial weight in adaptive penalized logistic regression (CBPLR) is proposed. The effectiveness of the CBPLR is examined on three well-known high-dimensional cancer classification datasets using number of selected genes, area under the curve, and misclassification rate. The experimental results reveal that the proposed CBPLR is quite efficient and feasible for cancer classification. Additionally, the proposed weight is compared with APLR and LASSO and exhibits competitive performance in both classification accuracy and gene selection. The proposed CBPLR has significant impact in penalized logistic regression by selecting fewer genes with high area under the curve and low misclassification rate. Thus, the proposed weight could conceivably be used in other research that implements gene selection in the field of high dimensional cancer classification.


Computers in Biology and Medicine | 2015

Regularized logistic regression with adjusted adaptive elastic net for gene selection in high dimensional cancer classification

Zakariya Yahya Algamal; Muhammad Hisyam Lee

Cancer classification and gene selection in high-dimensional data have been popular research topics in genetics and molecular biology. Recently, adaptive regularized logistic regression using the elastic net regularization, which is called the adaptive elastic net, has been successfully applied in high-dimensional cancer classification to tackle both estimating the gene coefficients and performing gene selection simultaneously. The adaptive elastic net originally used elastic net estimates as the initial weight, however, using this weight may not be preferable for certain reasons: First, the elastic net estimator is biased in selecting genes. Second, it does not perform well when the pairwise correlations between variables are not high. Adjusted adaptive regularized logistic regression (AAElastic) is proposed to address these issues and encourage grouping effects simultaneously. The real data results indicate that AAElastic is significantly consistent in selecting genes compared to the other three competitor regularization methods. Additionally, the classification performance of AAElastic is comparable to the adaptive elastic net and better than other regularization methods. Thus, we can conclude that AAElastic is a reliable adaptive regularized logistic regression method in the field of high-dimensional cancer classification.


Journal of Chemometrics | 2015

High‐dimensional QSAR prediction of anticancer potency of imidazo[4,5‐b]pyridine derivatives using adjusted adaptive LASSO

Zakariya Yahya Algamal; Muhammad Hisyam Lee; Abdo Mohammed Al-Fakih; Madzlan Aziz

In high‐dimensional quantitative structure–activity relationship (QSAR) studies, identifying relevant molecular descriptors is a major goal. In this study, a proposed penalized method is used as a tool for molecular descriptors selection. The method, called adjusted adaptive least absolute shrinkage and selection operator (LASSO) (AALASSO), is employed to study the high‐dimensional QSAR prediction of the anticancer potency of a series of imidazo[4,5‐b]pyridine derivatives. This proposed penalized method can perform consistency selection and deal with grouping effects simultaneously. Compared with other commonly used penalized methods, such as LASSO and adaptive LASSO with different initial weights, the results show that AALASSO obtains the best predictive ability not only by consistency selection but also by encouraging grouping effects in selecting more correlated molecular descriptors. Hence, we conclude that AALASSO is a reliable penalized method in the field of high‐dimensional QSAR studies. Copyright


Journal of Chemometrics | 2016

High-dimensional quantitative structure–activity relationship modeling of influenza neuraminidase a/PR/8/34 (H1N1) inhibitors based on a two-stage adaptive penalized rank regression

Zakariya Yahya Algamal; Muhammad Hisyam Lee; Abdo Mohammed Al-Fakih

Outliers in the biological activity variable or the heavy tailed distribution of the error are often encountered in practice. Under these circumstances, the quantittative structure–activity relationship (QSAR) model using multiple linear regression is not efficient. In this paper, a two‐stage adaptive penalized rank regression is proposed for constructing a robust and efficient high‐dimensional QSAR model of influenza virus neuraminidase A/PR/8/34 (H1N1) inhibitors. The results demonstrate the effectiveness of our proposed method in simultaneously estimating a robust QSAR model and selecting informative molecular descriptors. Furthermore, the results prove that the proposed method can significantly encourage the grouping effect. The proposed method, because of the high predictive ability and robustness, could be a useful method in high‐dimensional QSAR modeling. Copyright


Journal of Chemometrics | 2016

Quantitative structure–activity relationship model for prediction study of corrosion inhibition efficiency using two-stage sparse multiple linear regression

Abdo Mohammed Al-Fakih; Zakariya Yahya Algamal; Muhammad Hisyam Lee; Hassan H. Abdallah; Hasmerya Maarof; Madzlan Aziz

A new quantitative structure–activity relationship (QSAR) of the inhibition of mild steel corrosion in 1 M hydrochloric acid using furan derivatives was developed by proposing two‐stage sparse multiple linear regression. The sparse multiple linear regression using ridge penalty and sparse multiple linear regression using elastic net (SMLRE) were used to develop the QSAR model. The results show that the SMLRE‐based model possesses high predictive power compared with sparse multiple linear regression using ridge penalty‐based model according to the mean‐squared errors for both training and test datasets, leave‐one‐out internal validation (Q2int = 0.98), and external validation (Q2ext = 0.95). In addition, the results of applicability domain assessment using the leverage approach reveal a reliable and robust SMLRE‐based model. In conclusion, the developed QSAR model using SMLRE can be efficiently used in the studies of corrosion inhibition efficiency. Copyright


Sar and Qsar in Environmental Research | 2016

High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm

Zakariya Yahya Algamal; Muhammad Hisyam Lee; Abdo Mohammed Al-Fakih; Madzlan Aziz

Abstract In high-dimensional quantitative structure–activity relationship (QSAR) modelling, penalization methods have been a popular choice to simultaneously address molecular descriptor selection and QSAR model estimation. In this study, a penalized linear regression model with L1/2-norm is proposed. Furthermore, the local linear approximation algorithm is utilized to avoid the non-convexity of the proposed method. The potential applicability of the proposed method is tested on several benchmark data sets. Compared with other commonly used penalized methods, the proposed method can not only obtain the best predictive ability, but also provide an easily interpretable QSAR model. In addition, it is noteworthy that the results obtained in terms of applicability domain and Y-randomization test provide an efficient and a robust QSAR model. It is evident from the results that the proposed method may possibly be a promising penalized method in the field of computational chemistry research, especially when the number of molecular descriptors exceeds the number of compounds.


Sar and Qsar in Environmental Research | 2017

A new adaptive L1-norm for optimal descriptor selection of high-dimensional QSAR classification model for anti-hepatitis C virus activity of thiourea derivatives

Zakariya Yahya Algamal; Muhammad Hisyam Lee

Abstract A high-dimensional quantitative structure–activity relationship (QSAR) classification model typically contains a large number of irrelevant and redundant descriptors. In this paper, a new design of descriptor selection for the QSAR classification model estimation method is proposed by adding a new weight inside L1-norm. The experimental results of classifying the anti-hepatitis C virus activity of thiourea derivatives demonstrate that the proposed descriptor selection method in the QSAR classification model performs effectively and competitively compared with other existing penalized methods in terms of classification performance on both the training and the testing datasets. Moreover, it is noteworthy that the results obtained in terms of stability test and applicability domain provide a robust QSAR classification model. It is evident from the results that the developed QSAR classification model could conceivably be employed for further high-dimensional QSAR classification studies.


Journal of Chemometrics | 2017

High-dimensional QSAR classification model for anti-hepatitis C virus activity of thiourea derivatives based on the sparse logistic regression model with a bridge penalty

Zakariya Yahya Algamal; Muhammad Hisyam Lee; Abdo Mohammed Al-Fakih; Madzlan Aziz

This study addresses the problem of the high‐dimensionality of quantitative structure‐activity relationship (QSAR) classification modeling. A new selection of descriptors that truly affect biological activity and a QSAR classification model estimation method are proposed by combining the sparse logistic regression model with a bridge penalty for classifying the anti‐hepatitis C virus activity of thiourea derivatives. Compared to other commonly used sparse methods, the proposed method shows superior results in terms of classification accuracy and model interpretation.


Sar and Qsar in Environmental Research | 2017

A QSAR classification model for neuraminidase inhibitors of influenza A viruses (H1N1) based on weighted penalized support vector machine

Zakariya Yahya Algamal; M.K. Qasim; H. T. M. Ali

Abstract Descriptor selection is a procedure widely used in chemometrics. The aim is to select the best subset of descriptors relevant to the quantitative structure–activity relationship (QSAR) study being considered. In this paper, a new descriptor selection method for the QSAR classification model is proposed by adding a new weight inside L1-norm. The experimental results from classifying the neuraminidase inhibitors of influenza A viruses (H1N1) demonstrate that the proposed method in the QSAR classification model performs effectively and competitively compared with other existing penalized methods in terms of classification performance and the number of selected descriptors.


Computers in Biology and Medicine | 2018

Gene selection for microarray gene expression classification using Bayesian Lasso quantile regression

Zakariya Yahya Algamal; Rahim Alhamzawi; Haithem Taha Mohammad Ali

Gene selection has been proven to be an effective way to improve the results of many classification methods. However, existing gene selection techniques in binary classification regression are sensitive to outliers of the data, heteroskedasticity or other anomalies of the latent response. In this paper, we propose a new Bayesian hierarchical model to overcome these problems in a relatively straightforward way. In particular, we propose a new Bayesian Lasso method that employs a skewed Laplace distribution for the errors and a scaled mixture of uniform distribution for the regression parameters, together with Bayesian MCMC estimation. Comprehensive comparisons between our proposed gene selection method and other competitor methods are performed experimentally, depending on four benchmark gene expression datasets. The experimental results prove that the proposed method is very effective for selecting the most relevant genes with high classification accuracy.

Collaboration


Dive into the Zakariya Yahya Algamal's collaboration.

Top Co-Authors

Avatar

Muhammad Hisyam Lee

Universiti Teknologi Malaysia

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Madzlan Aziz

Universiti Teknologi Malaysia

View shared research outputs
Top Co-Authors

Avatar

Hasmerya Maarof

Universiti Teknologi Malaysia

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Norazah Basar

Universiti Teknologi Malaysia

View shared research outputs
Researchain Logo
Decentralizing Knowledge