Zuraini Ali Shah
Universiti Teknologi Malaysia
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Zuraini Ali Shah.
Computers in Biology and Medicine | 2010
Rosfuzah Roslan; Razib M. Othman; Zuraini Ali Shah; Shahreen Kasim; Hishammuddin Asmuni; Jumail Taliba; Rohayanti Hassan; Zalmiyah Zakaria
Protein-protein interactions (PPIs) play a significant role in many crucial cellular operations such as metabolism, signaling and regulations. The computational methods for predicting PPIs have shown tremendous growth in recent years, but problem such as huge false positive rates has contributed to the lack of solid PPI information. We aimed at enhancing the overlap between computational predictions and experimental results in an effort to partially remove PPIs falsely predicted. The use of protein function predictor named PFP() that are based on shared interacting domain patterns is introduced in this study with the purpose of aiding the Gene Ontology Annotations (GOA). We used GOA and PFP() as agents in a filtering process to reduce false positive pairs in the computationally predicted PPI datasets. The functions predicted by PFP() were extracted from cross-species PPI data in order to assign novel functional annotations for the uncharacterized proteins and also as additional functions for those that are already characterized by the GO (Gene Ontology). The implementation of PFP() managed to increase the chances of finding matching function annotation for the first rule in the filtration process as much as 20%. To assess the capability of the proposed framework in filtering false PPIs, we applied it on the available S. cerevisiae PPIs and measured the performance in two aspects, the improvement made indicated as Signal-to-Noise Ratio (SNR) and the strength of improvement, respectively. The proposed filtering framework significantly achieved better performance than without it in both metrics.
BioMed Research International | 2014
Ching S iang Tan; Wai S oon Ting; Mohd Saberi Mohamad; Weng H owe Chan; Safaai Deris; Zuraini Ali Shah
When gene expression data are too large to be processed, they are transformed into a reduced representation set of genes. Transforming large-scale gene expression data into a set of genes is called feature extraction. If the genes extracted are carefully chosen, this gene set can extract the relevant information from the large-scale gene expression data, allowing further analysis by using this reduced representation instead of the full size data. In this paper, we review numerous software applications that can be used for feature extraction. The software reviewed is mainly for Principal Component Analysis (PCA), Independent Component Analysis (ICA), Partial Least Squares (PLS), and Local Linear Embedding (LLE). A summary and sources of the software are provided in the last section for each feature extraction method.
Information Sciences | 2010
Rosfuzah Roslan; Razib M. Othman; Zuraini Ali Shah; Shahreen Kasim; Hishammuddin Asmuni; Jumail Taliba; Rohayanti Hassan; Zalmiyah Zakaria
Protein-protein interaction (PPI) networks play an outstanding role in the organization of life. Parallel to the growth of experimental techniques on determining PPIs, the emergence of computational methods has greatly accelerated the time needed for the identification of PPIs on a wide genomic scale. Although experimental approaches have limitations that can be complemented by the computational methods, the results from computational methods still suffer from high false positive rates which contribute to the lack of solid PPI information. Our study introduces the PPI-Filter; a computational framework aimed at improving PPI prediction results. It is a post-prediction process which involves filtration, using information based on three different genomic features; (i) gene ontology annotation (GOA), (ii) homologous interactions and (iii) protein families (PFAM) domain interactions. In the study, we incorporated a protein function prediction method, based on interacting domain patterns, the protein function predictor or PFP (), for the purpose of aiding the GOA. The goal is to improve the robustness of predicted PPI pairs by removing the false positive pairs and sustaining as much true positive pairs as possible, thus achieving a high confidence level of PPI datasets. The PPI-Filter has been proven to be applicable based on the satisfactory results obtained using signal-to-noise ratio (SNR) and strength measurements that were applied on different computational PPI prediction methods.
Computers in Biology and Medicine | 2009
Hassan U. Kalsum; Zuraini Ali Shah; Razib M. Othman; Rohayanti Hassan; Shafry M. Rahim; Hishammuddin Asmuni; Jumail Taliba; Zalmiyah Zakaria
Protein domains contain information about the prediction of protein structure, function, evolution and design since the protein sequence may contain several domains with different or the same copies of the protein domain. In this study, we proposed an algorithm named SplitSSI-SVM that works with the following steps. First, the training and testing datasets are generated to test the SplitSSI-SVM. Second, the protein sequence is split into subsequence based on order and disorder regions. The protein sequence that is more than 600 residues is split into subsequences to investigate the effectiveness of the protein domain prediction based on subsequence. Third, multiple sequence alignment is performed to predict the secondary structure using bidirectional recurrent neural networks (BRNN) where BRNN considers the interaction between amino acids. The information of about protein secondary structure is used to increase the protein domain boundaries signal. Lastly, support vector machines (SVM) are used to classify the protein domain into single-domain, two-domain and multiple-domain. The SplitSSI-SVM is developed to reduce misleading signal, lower protein domain signal caused by primary structure of protein sequence and to provide accurate classification of the protein domain. The performance of SplitSSI-SVM is evaluated using sensitivity and specificity on single-domain, two-domain and multiple-domain. The evaluation shows that the SplitSSI-SVM achieved better results compared with other protein domain predictors such as DOMpro, GlobPlot, Dompred-DPS, Mateo, Biozon, Armadillo, KemaDom, SBASE, HMMPfam and HMMSMART especially in two-domain and multiple-domain.
Current Bioinformatics | 2017
Swee Kuan Loh; Swee Thing Low; Lian En Chai; Weng Howe Chan; Mohd Saberi Mohamad; Safaai Deris; Zuwairie Ibrahim; Shahreen Kasim; Zuraini Ali Shah; Hamimah Mohd Jamil; Z. Zakariaa; Suhaimi Napis
Recently, novel high-throughput biotechnologies have provided rich data about different genomes. However, manual annotation of gene function is time consuming. It is also very expensive and infeasible for the growing amounts of data. At present there are numerous functions in certain species that remain unknown or only partially known. Hence, the use of computational approaches to predicting gene function is becoming widespread. Computational approaches are time saving and less costly. Prediction analysis provided can be used in hypotheses to drive the biological validation of gene function. Objective: This paper reviews computational approaches such as the support vector machine, clustering, hierarchical ensemble and network-based approaches. Methods: Comparisons between these approaches are also made in the discussion portion. Results: In addition, the advantages and disadvantages of these computational approaches are discussed. Conclusion: With the emergence of omics data, the focus should be continued on integrating newly added data for gene functions prediction field.
data mining in bioinformatics | 2015
Rohayanti Hassan; Razib M. Othman; Zuraini Ali Shah
To date, classification of structural class using local protein structure rather than the whole structure has been gaining widespread attention. It is noted that the structural class lies in local composition or arrangement of secondary structure, while the threshold-based classification method has restricted rules in determining these structural classes. As a consequence, some of the structures are unknown. In order to determine these unknown structural classes, we propose a fusion algorithm, abbreviated as GSVM-SigLpsSCPred (Granular Support Vector Machine--with Significant Local protein structure for Structural Class Prediction), which consists of two major components, which are: optimal local protein structure to represent the feature vector and granular support vector machine to predict the unknown structural classes. The results highlight the performance of GSVM-SigLpsSCPred as an alternative computational method for low-identity sequences.
2014 8th. Malaysian Software Engineering Conference (MySEC) | 2014
Hidayah Zakaria; Rohayanti Hassan; Razib M. Othman; Hishamudin Asmuni; Zuraini Ali Shah
International journal of engineering and technology | 2018
Dhabitah Lazim; Zuraini Ali Shah; Rd Rohmat Saedudin; Shahreen Kasim; Ayu Alyani Azadin; Rahmat Hidayat; Ansari Saleh Ahmar; Ikhwan Arief
International journal of engineering and technology | 2018
Fatin Azura Ahmad Fauzy; Zuraini Ali Shah; Rd Rohmat Saedudin; Shahreen Kasim; Ayu Alyani Azadin; Ansari Saleh Ahmar; Rahmat Hidayat
International Journal on Advanced Science, Engineering and Information Technology | 2018
Shahreen Kasim; Sherylaidah Samsuddin; Zuraini Ali Shah; Rodziah Atan