Olufemi A. Omitaomu | Researchain

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Olufemi A. Omitaomu is active.

Explore More

Publication

Featured researches published by Olufemi A. Omitaomu.

intelligent data analysis | 2009

Knowledge discovery from data streams

João Gama; Auroop R. Ganguly; Olufemi A. Omitaomu; Ranga Raju Vatsavai; Mohamed Medhat Gaber

Since the beginning of the Internet age and the increased use of ubiquitous computing devices, the large volume and continuous flow of distributed data have imposed new constraints on the design of learning algorithms. Exploring how to extract knowledge structures from evolving and time-changing data, Knowledge Discovery from Data Streams presents a coherent overview of state-of-the-art research in learning from data streams. The book covers the fundamentals that are imperative to understanding data streams and describes important applications, such as TCP/IP traffic, GPS data, sensor networks, and customer click streams. It also addresses several challenges of data mining in the future, when stream mining will be at the core of many applications. These challenges involve designing useful and efficient data mining solutions applicable to real-world problems. In the appendix, the author includes examples of publicly available software and online data sets. This practical, up-to-date book focuses on the new requirements of the next generation of data mining. Although the concepts presented in the text are mainly about data streams, they also are valid for different areas of machine learning and data mining.

Pattern Recognition | 2011

Weighted dynamic time warping for time series classification

Young-Seon Jeong; Myong K. Jeong; Olufemi A. Omitaomu

Dynamic time warping (DTW), which finds the minimum path by providing non-linear alignments between two time series, has been widely used as a distance measure for time series classification and clustering. However, DTW does not account for the relative importance regarding the phase difference between a reference point and a testing point. This may lead to misclassification especially in applications where the shape similarity between two sequences is a major consideration for an accurate recognition. Therefore, we propose a novel distance measure, called a weighted DTW (WDTW), which is a penalty-based DTW. Our approach penalizes points with higher phase difference between a reference point and a testing point in order to prevent minimum distance distortion caused by outliers. The rationale underlying the proposed distance measure is demonstrated with some illustrative examples. A new weight function, called the modified logistic weight function (MLWF), is also proposed to systematically assign weights as a function of the phase difference between a reference point and a testing point. By applying different weights to adjacent points, the proposed algorithm can enhance the detection of similarity between two time series. We show that some popular distance measures such as DTW and Euclidean distance are special cases of our proposed WDTW measure. We extend the proposed idea to other variants of DTW such as derivative dynamic time warping (DDTW) and propose the weighted version of DDTW. We have compared the performances of our proposed procedures with other popular approaches using public data sets available through the UCR Time Series Data Mining Archive for both time series classification and clustering problems. The experimental results indicate that the proposed approaches can achieve improved accuracy for time series classification and clustering problems.

Archive | 2008

Knowledge Discovery from Sensor Data

Auroop R. Ganguly; João Gama; Olufemi A. Omitaomu; Mohamed Medhat Gaber; Ranga Raju Vatsavai

Addressing the issues challenging the sensor community, this book presents innovative solutions in offline data mining and real-time analysis of sensor or geographically distributed data. Illustrated with case studies, it discusses the challenges and requirements for sensor data-based knowledge discovery solutions in high-priority application. The book then explores the fusion between heterogeneous data streams from multiple sensor types and applications in science, engineering, and security. Bringing together researchers from academia, government, and the private sector, this book delineates the application of knowledge modeling in data intensive operations. Multi/Card Deck Copy

IEEE Sensors Journal | 2011

Empirical Mode Decomposition Technique With Conditional Mutual Information for Denoising Operational Sensor Data

Olufemi A. Omitaomu; Vladimir Protopopescu; Auroop R. Ganguly

This paper presents a new approach for denoising sensor signals using the Empirical Mode Decomposition (EMD) technique and the Information-theoretic method. The EMD technique is applied to decompose a noisy sensor signal into the so-called intrinsic mode functions (IMFs). These functions are of the same length and in the same time domain as the original signal. Therefore, the EMD technique preserves varying frequency in time. Assuming the given signal is corrupted by high-frequency (HF) Gaussian noise implies that most of the noise should be captured by the first few modes. Therefore, our proposition is to separate the modes into HF and low-frequency (LF) groups. We applied an information-theoretic method, namely, mutual information to determine the cutoff for separating the modes. A denoising procedure is applied only to the HF group using a shrinkage approach. Upon denoising, this group is combined with the original LF group to obtain the overall denoised signal. We illustrate our approach with simulated and real-world cargo radiation data sets. The results are compared to two popular denoising techniques in the literature, namely discrete Fourier transform (DFT) and discrete wavelet transform (DWT). We found that our approach performs better than DWT and DFT in most cases, and comparatively to DWT in some cases in terms of: 1) mean square error; 2) recomputed signal-to-noise ratio; and 3) visual quality of the denoised signals.

The Engineering Economist | 2007

Fuzzy Present Value Analysis Model For Evaluating Information System Projects

Olufemi A. Omitaomu; Adedeji Badiru

In this article, the economic evaluation of information system projects using present value is analyzed based on triangular fuzzy numbers. Information system projects usually have numerous uncertainties and several conditions of risk that make their economic evaluation a challenging task. Each year, several information system projects are cancelled before completion as a result of budget overruns at a cost of several billions of dollars to industry. Although engineering economic analysis offers tools and techniques for evaluating risky projects, the tools are not enough to place information system projects on a safe budget/selection track. There is a need for an integrative economic analysis model that will account for the uncertainties in estimating project costs, benefits, and useful lives of uncertain and risky projects. In this study, we propose an approximate method of computing project present value using the concept of fuzzy modeling with special reference to information system projects. This proposed model has the potential of enhancing the project selection process by capturing a better economic picture of the project alternatives. The proposed methodology can also be used for other real-life projects with high degree of uncertainty and risk.

systems man and cybernetics | 2007

Online Support Vector Regression Approach for the Monitoring of Motor Shaft Misalignment and Feedwater Flow Rate

Olufemi A. Omitaomu; Jeong Myong; Adedeji Badiru; J. W. Hines

Timely and accurate information about incipient faults in online machines will greatly enhance the development of optimal maintenance procedures. The application of support vector regression to machine health monitoring was recently investigated; however, such implementation is based on batch processing of the available data. Therefore, the addition of new sample to the already existing dataset requires that the technique should retrain from scratch. This disadvantage makes the technique unsuitable for online systems that will give real-time information to field engineers so that corrective actions could be taken before there is any damage to the system. This paper presents an application of accurate online support vector regression (AOSVR) approach that efficiently updates a trained predictor whenever a new sample is added to the training set using shaft misalignment and nuclear power plant feedwater flow rate data. The results show that the approach is effective for online machine condition monitoring where it is usually difficult to obtain sufficient training data prior to the installation of the online systems.

systems man and cybernetics | 2011

Online Support Vector Regression With Varying Parameters for Time-Dependent Data

Olufemi A. Omitaomu; Myong K. Jeong; Adedeji Badiru

Support vector regression (SVR) is a machine learning technique that continues to receive interest in several domains, including manufacturing, engineering, and medicine. In order to extend its application to problems in which data sets arrive constantly and in which batch processing of the data sets is infeasible or expensive, an accurate online SVR (AOSVR) technique was proposed. The AOSVR technique efficiently updates a trained SVR function whenever a sample is added to or removed from the training set without retraining the entire training data. However, the AOSVR technique assumes that the new samples and the training samples are of the same characteristics; hence, the same value of SVR parameters is used for training and prediction. This assumption is not applicable to data samples that are inherently noisy and nonstationary, such as sensor data. As a result, we propose AOSVR with varying parameters that uses varying SVR parameters rather than fixed SVR parameters and hence accounts for the variability that may exist in the samples. To accomplish this objective, we also propose a generalized weight function to automatically update the weights of SVR parameters in online monitoring applications. The proposed function allows for lower and upper bounds for SVR parameters. We tested our proposed approach and compared results with the conventional AOSVR approach using two benchmark time-series data and sensor data from a nuclear power plant. The results show that using varying SVR parameters is more applicable to time-dependent data.

IEEE Transactions on Intelligent Transportation Systems | 2009

Anomaly Detection in Radiation Sensor Data With Application to Transportation Security

Olufemi A. Omitaomu; Auroop R. Ganguly; Bruce W. Patton; Vladimir Protopopescu

In this paper, we present a new approach for detecting trucks transporting illicit radioactive materials using radiation data. The approach is motivated by the high number of false alarms that typically results when using radiation portal monitors. Our approach is a three-stage anomaly detection process that consists of transforming the radiation sensor data into wavelet coefficients, representing the transformed data in binary form, and detecting anomalies among data sets using a proximity-based method. The approach is evaluated using simulated radiation data, and the results are encouraging. From a transportation security perspective, our results indicate that the concomitant use of gross count and spectroscopy radiation data improves identification of trucks transporting illicit radioactive materials. The results also suggest that the use of additional heterogeneous data with radiation data may enhance the reliability of the detection process. Further testing with real radiation data and mixture of cargo is needed to fully validate the results.

Computers & Industrial Engineering | 2008

Data mining approaches for modeling complex electronic circuit design activities

Yongjin Kwon; Olufemi A. Omitaomu; Gi-Nam Wang

A printed circuit board (PCB) is an essential part of modern electronic circuits. It is made of a flat panel of insulating materials with patterned copper foils that act as electric pathways for various components such as ICs, diodes, capacitors, resistors, and coils. The size of PCBs has been shrinking over the years, while the number of components mounted on these boards has increased considerably. This trend makes the design and fabrication of PCBs ever more difficult. At the beginning of design cycles, it is important to estimate the time to complete the steps required accurately, based on many factors such as the required parts, approximate board size and shape, and a rough sketch of schematics. Current approach uses multiple linear regression (MLR) technique for time and cost estimations. However, the need for accurate predictive models continues to grow as the technology becomes more advanced. In this paper, we analyze a large volume of historical PCB design data, extract some important variables, and develop predictive models based on the extracted variables using a data mining approach. The data mining approach uses an adaptive support vector regression (ASVR) technique; the benchmark model used is the MLR technique currently being used in the industry. The strengths of SVR for this data include its ability to represent data in high-dimensional space through kernel functions. The computational results show that a data mining approach is a better prediction technique for this data. Our approach reduces computation time and enhances the practical applications of the SVR technique.

Journal of Agricultural and Applied Economics | 2007

The Impact of an Urban Growth Boundary on Land Development in Knox County, Tennessee: A Comparison of Two-Stage Probit Least Squares and Multilayer Neural Network Models

Seong-Hoon Cho; Olufemi A. Omitaomu; Neelam C. Poudyal; David B. Eastwood

The impact of an urban growth boundary (UGB) on land development in Knox County, TN is estimated via two-stage probit and neural-network models. The insignificance of UGB variable in the two-stage probit model and more visible development patterns in the western part of Knoxville and the neighboring town of Farragut during the post-UGB period in both models suggest that the UGB has not curtailed urban sprawl. Although the network model is found to be a viable alternative to more conventional discrete choice approach for improving the predictability of land development, it is at the cost of evaluating marginal effects.

Explore More