Zexuan Zhu | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Zexuan Zhu is active.

Explore More

Publication

Featured researches published by Zexuan Zhu.

systems man and cybernetics | 2007

Wrapper–Filter Feature Selection Algorithm Using a Memetic Framework

Zexuan Zhu; Yew-Soon Ong; Manoranjan Dash

This correspondence presents a novel hybrid wrapper and filter feature selection algorithm for a classification problem using a memetic framework. It incorporates a filter ranking method in the traditional genetic algorithm to improve classification performance and accelerate the search in identifying the core feature subsets. Particularly, the method adds or deletes a feature from a candidate feature subset based on the univariate feature ranking information. This empirical study on commonly used data sets from the University of California, Irvine repository and microarray data sets shows that the proposed method outperforms existing methods in terms of classification accuracy, number of selected features, and computational efficiency. Furthermore, we investigate several major issues of memetic algorithm (MA) to identify a good balance between local search and genetic search so as to maximize search quality and efficiency in the hybrid filter and wrapper MA

Neurocomputing | 2008

A fast pruned-extreme learning machine for classification problem

Hai-Jun Rong; Yew-Soon Ong; Ah-Hwee Tan; Zexuan Zhu

Extreme learning machine (ELM) represents one of the recent successful approaches in machine learning, particularly for performing pattern classification. One key strength of ELM is the significantly low computational time required for training new classifiers since the weights of the hidden and output nodes are randomly chosen and analytically determined, respectively. In this paper, we address the architectural design of the ELM classifier network, since too few/many hidden nodes employed would lead to underfitting/overfitting issues in pattern classification. In particular, we describe the proposed pruned-ELM (P-ELM) algorithm as a systematic and automated approach for designing ELM classifier network. P-ELM uses statistical methods to measure the relevance of hidden nodes. Beginning from an initial large number of hidden nodes, irrelevant nodes are then pruned by considering their relevance to the class labels. As a result, the architectural design of ELM network classifier can be automated. Empirical study of P-ELM on several commonly used classification benchmark problems and with diverse forms of hidden node functions show that the proposed approach leads to compact network classifiers that generate fast response and robust prediction accuracy on unseen data, comparing with traditional ELM and other popular machine learning approaches.

Pattern Recognition | 2007

Markov blanket-embedded genetic algorithm for gene selection

Zexuan Zhu; Yew-Soon Ong; Manoranjan Dash

Microarray technologies enable quantitative simultaneous monitoring of expression levels for thousands of genes under various experimental conditions. This new technology has provided a new way of biological classification on a genome-wide scale. However, predictive accuracy is affected by the presence of thousands of genes many of which are unnecessary from the classification point of view. So, a key issue of microarray data classification is to identify the smallest possible set of genes that can achieve good predictive accuracy. In this study, we propose a novel Markov blanket-embedded genetic algorithm (MBEGA) for gene selection problem. In particular, the embedded Markov blanket-based memetic operators add or delete features (or genes) from a genetic algorithm (GA) solution so as to quickly improve the solution and fine-tune the search. Empirical results on synthetic and microarray benchmark datasets suggest that MBEGA is effective and efficient in eliminating irrelevant and redundant features based on both Markov blanket and predictive power in classifier model. A detailed comparative study with other methods from each of filter, wrapper, and standard GA shows that MBEGA gives a best compromise among all four evaluation criteria, i.e., classification accuracy, number of selected genes, computational cost, and robustness.

IEEE Transactions on Evolutionary Computation | 2011

DNA Sequence Compression Using Adaptive Particle Swarm Optimization-Based Memetic Algorithm

Zexuan Zhu; Jiarui Zhou; Zhen Ji; Yuhui Shi

With the rapid development of high-throughput DNA sequencing technologies, the amount of DNA sequence data is accumulating exponentially. The huge influx of data creates new challenges for storage and transmission. This paper proposes a novel adaptive particle swarm optimization-based memetic algorithm (POMA) for DNA sequence compression. POMA is a synergy of comprehensive learning particle swarm optimization (CLPSO) and an adaptive intelligent single particle optimizer (AdpISPO)-based local search. It takes advantage of both CLPSO and AdpISPO to optimize the design of approximate repeat vector (ARV) codebook for DNA sequence compression. ARV is first introduced in this paper to represent the repeated fragments across multiple sequences in direct, mirror, pairing, and inverted patterns. In POMA, candidate ARV codebooks are encoded as particles and the optimal solution, which covers the most approximate repeated fragments with the fewest base variations, is identified through the exploration and exploitation of POMA. In each iteration of POMA, the leader particles in the swarm are selected based on weighted fitness values and each leader particle is fine-tuned with an AdpISPO-based local search, so that the convergence of the search in local region is accelerated. A detailed comparison study between POMA and the counterpart algorithms is performed on 29 (23 basic and 6 composite) benchmark functions and 11 real DNA sequences. POMA is observed to obtain better or competitive performance with a limited number of function evaluations. POMA also attains lower bits-per-base than other state-of-the-art DNA-specific algorithms on DNA sequence data. The experimental results suggest that the cooperation of CLPSO and AdpISPO in the framework of memetic algorithm is capable of searching the ARV codebook space efficiently.

PLOS Computational Biology | 2017

PBMDA: A novel and effective path-based computational model for miRNA-disease association prediction

Zhu-Hong You; Zhi-An Huang; Zexuan Zhu; Guiying Yan; Zheng-Wei Li; Zhenkun Wen; Xing Chen

In the recent few years, an increasing number of studies have shown that microRNAs (miRNAs) play critical roles in many fundamental and important biological processes. As one of pathogenetic factors, the molecular mechanisms underlying human complex diseases still have not been completely understood from the perspective of miRNA. Predicting potential miRNA-disease associations makes important contributions to understanding the pathogenesis of diseases, developing new drugs, and formulating individualized diagnosis and treatment for diverse human complex diseases. Instead of only depending on expensive and time-consuming biological experiments, computational prediction models are effective by predicting potential miRNA-disease associations, prioritizing candidate miRNAs for the investigated diseases, and selecting those miRNAs with higher association probabilities for further experimental validation. In this study, Path-Based MiRNA-Disease Association (PBMDA) prediction model was proposed by integrating known human miRNA-disease associations, miRNA functional similarity, disease semantic similarity, and Gaussian interaction profile kernel similarity for miRNAs and diseases. This model constructed a heterogeneous graph consisting of three interlinked sub-graphs and further adopted depth-first search algorithm to infer potential miRNA-disease associations. As a result, PBMDA achieved reliable performance in the frameworks of both local and global LOOCV (AUCs of 0.8341 and 0.9169, respectively) and 5-fold cross validation (average AUC of 0.9172). In the cases studies of three important human diseases, 88% (Esophageal Neoplasms), 88% (Kidney Neoplasms) and 90% (Colon Neoplasms) of top-50 predicted miRNAs have been manually confirmed by previous experimental reports from literatures. Through the comparison performance between PBMDA and other previous models in case studies, the reliable performance also demonstrates that PBMDA could serve as a powerful computational tool to accelerate the identification of disease-miRNA associations.

Integrated Computer-aided Engineering | 2015

Global path planning of wheeled robots using multi-objective memetic algorithms

Zexuan Zhu; Jun Xiao; Jian-Qiang Li; Fangxiao Wang; Qingfu Zhang

Global path planning is a fundamental problem of mobile robotics. The majority of global path planning methods are designed to find a collision-free path from a start location to a target location while optimizing one or more objectives like path length, smoothness, and safety at a time. It is noted that providing multiple tradeoff path solutions of different objectives is much more beneficial to the user’s choice than giving a single optimal solution in terms of some specific criterion. This paper proposes a global path planning of wheeled robots using multi-objective memetic algorithms (MOMAs). Particularly, two MOMAs are implemented based on conventional multi-objective genetic algorithms with elitist non-dominated sorting and decomposition strategies respectively to optimize the path length and smoothness simultaneously. Novel path encoding scheme, path refinement, and specific evolutionary operators are designed and introduced to the MOMAs to enhance the search ability of the algorithms as well as guarantee the safety of the candidate paths obtained in complex environments. Experimental results on both simulated and real environments show that the proposed MOMAs are efficient in planning a set of valid tradeoff paths in complex environments.

Briefings in Bioinformatics | 2015

High-throughput DNA sequence data compression

Zexuan Zhu; Yongpeng Zhang; Zhen Ji; Shan He; Xiao Yang

The exponential growth of high-throughput DNA sequence data has posed great challenges to genomic data storage, retrieval and transmission. Compression is a critical tool to address these challenges, where many methods have been developed to reduce the storage size of the genomes and sequencing data (reads, quality scores and metadata). However, genomic data are being generated faster than they could be meaningfully analyzed, leaving a large scope for developing novel compression algorithms that could directly facilitate data analysis beyond data transfer and storage. In this article, we categorize and provide a comprehensive review of the existing compression methods specialized for genomic data and present experimental results on compression ratio, memory usage, time for compression and decompression. We further present the remaining challenges and potential directions for future research.

Applied Physics Letters | 2007

A direct first principles study on the structure and electronic properties of BexZn1−xO

Xiaofeng Fan; Zexuan Zhu; Yew-Soon Ong; Y. M. Lu; Zexiang Shen; Jer-Lai Kuo

We present a systematic study on the structural and electronic properties of all alloy configurations of BexZn1−xO in a unit cell with 16 cations using density functional theory (DFT) methods. The 216 complexity is reduced by considering the symmetry of the parent structures. The experimental structures and electronic properties of the bulk material are reasonably reproduced by the DFT methods. The lattice constants of the alloy are found to follow Vegard’s law [Z. Phys. 5, 17 (1921)] and are comparable with the experimental values. Examining the formation enthalpy of all alloy configurations suggests the possible existence of three metastable order states. The calculated band gap of the BexZn1−xO is also compared with the experimental measurements and the authors found that some alloy configurations with the same concentration can have band gaps differed by ∼1.5eV.

Information Sciences | 2015

Three-dimensional Gabor feature extraction for hyperspectral imagery classification using a memetic framework

Zexuan Zhu; Sen Jia; Shan He; Yiwen Sun; Zhen Ji; Linlin Shen

Feature extraction based on three-dimensional (3D) wavelet transform is capable of improving the classification accuracy of hyperspectral imagery data by simultaneously capturing the geometrical and statistical spectral-spatial structure of the data. Nevertheless, the design of wavelets is always proceeded with empirical parameters, which tends to involve a large number of irrelevant and redundant spectral-spatial features and results in suboptimal configuration. This paper proposes a 3D Gabor wavelet feature extraction in a memetic framework, named M3DGFE, for hyperspectral imagery classification. Particularly, the parameter setting of 3D Gabor wavelet feature extraction is optimized using memetic algorithm so that discriminative and parsimonious feature set is acquired for accurate classification. M3DGFE is characterized by an efficient fitness evaluation function and a pruning local search. In the fitness evaluation function, a new concept of redundancy-free relevance based on conditional mutual information is proposed to measure the goodness of the extracted candidate features. The pruning local search is specially designed to eliminate both irrelevant and redundant features without sacrificing the discriminability of the obtained feature subset. M3DGFE is tested on both pixel-level and image-level classification using real-world hyperspectral remote sensing data and hyperspectral face data, respectively. The experimental results show that M3DGFE achieves promising classification accuracy with parsimonious feature subset.

IEEE Geoscience and Remote Sensing Letters | 2013

Discriminative Gabor Feature Selection for Hyperspectral Image Classification

Linlin Shen; Zexuan Zhu; Sen Jia; Jiasong Zhu; Yiwen Sun

Three-dimensional Gabor wavelets have recently been successfully applied for hyperspectral image classification due to their ability to extract joint spatial and spectrum information. However, the dimension of the extracted Gabor feature is incredibly huge. In this letter, we propose a symmetrical-uncertainty-based and Markov-blanket-based approach to select informative and nonredundant Gabor features for hyperspectral image classification. The extracted Gabor features with large dimension are first ranked by their information contained for classification and then added one by one after investigating the redundancy with already selected features. The proposed approach was fully tested on the widely used Indian Pine site data. The results show that the selected features are much more efficient and can achieve similar performance with previous approach using only hundreds of features.

Explore More