Xiaowei Yan

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Xiaowei Yan is active.

Explore More

Publication

Featured researches published by Xiaowei Yan.

Expert Systems With Applications | 2009

Genetic algorithm-based strategy for identifying association rules without specifying actual minimum support

Xiaowei Yan; Chengqi Zhang; Shichao Zhang

We design a genetic algorithm-based strategy for identifying association rules without specifying actual minimum support. In this approach, an elaborate encoding method is developed, and the relative confidence is used as the fitness function. With genetic algorithm, a global search can be performed and system automation is implemented, because our model does not require the user-specified threshold of minimum support. Furthermore, we expand this strategy to cover quantitative association rule discovery. For efficiency, we design a generalized FP-tree to implement this algorithm. We experimentally evaluate our approach, and demonstrate that our algorithms significantly reduce the computation costs and generate interesting association rules only.

Applied Artificial Intelligence | 2005

ARMGA: IDENTIFYING INTERESTING ASSOCIATION RULES WITH GENETIC ALGORITHMS

Xiaowei Yan; Chengqi Zhang; Shichao Zhang

ABSTRACT Priori-like algorithms for association rules mining have relied on two user-specified thresholds: minimum support and minimum confidence. There are two significant challenges to applying these algorithms to real-world applications: database-dependent minimum-support and exponential search space. Database-dependent minimum-support means that users must specify suitable thresholds for their mining tasks though they may have no knowledge concerning their databases. To circumvent these problems, in this paper, we design an evolutionary mining strategy, namely the ARMGA model, based on a genetic algorithm. Like general genetic algorithms, our ARMGA model is effective for global searching, especially when the search space is so large that it is hardly possible to use deterministic searching method.

Applied Artificial Intelligence | 2003

Toward databases mining: pre-processing collected data

Xiaowei Yan; Chengqi Zhang; Shichao Zhang

This paper presents a new means of selecting quality data for mining multiple data sources. Traditional data-mining strategies obtain necessary data from internal and external data sources and pool all the data into a huge homogeneous dataset for discovery. In contrast, our data-mining strategy identifies quality data from (internal and external) data sources for a mining task. A framework is advocated for generating quality data. Experimental results demonstrate that application of this new data collecting technique can not only identify quality data, but can also efficiently reduce the amount of data that must be considered during mining.

Applied Artificial Intelligence | 2007

ON DATA STRUCTURES FOR ASSOCIATION RULE DISCOVERY

Xiaowei Yan; Shichao Zhang; Chengqi Zhang

Systematically we study data structures used to implement the algorithms of association rule mining, including hash tree, itemset tree, and FP-tree (frequent pattern tree). Further, we present a generalized FP-tree in an applied context. This assists in better understanding existing association-rule-mining strategies. In addition, we discuss and analyze experimentally the generalized k-FP-tree, and demonstrate that the generalized FP-tree reduces the computation costs significantly. This study will be useful to many association analysis tasks where one must provide really interesting rules and develop efficient algorithms for identifying association rules.

Applied Artificial Intelligence | 2009

CONFIDENCE METRICS FOR ASSOCIATION RULE MINING

Xiaowei Yan; Chengqi Zhang; Shichao Zhang

We propose a simple, novel, and yet effective confidence metric for measuring the interestingness of association rules. Distinguishing from existing confidence measures, our metrics really indicate the positively companionate correlations between frequent itemsets. Furthermore, some desired properties are derived for examining the goodness of confidence measures in terms of probabilistic significance. We systematically analyze our metrics and traditional ones, and demonstrate that our new algorithm significantly captures the mainstream properties. Our approach will be useful to many association analysis tasks where one must provide actionable association rules and assist users to make quality decisions.

International Journal of Software Engineering and Knowledge Engineering | 2004

IDENTIFYING SOFTWARE COMPONENT ASSOCIATION WITH GENETIC ALGORITHM

Xiaowei Yan; Chengqi Zhang; Shichao Zhang

Identifying software component association is useful for component management and component retrieval. In this paper we design an evolutionary strategy to understand software structure better and identify software component association, by using genetic algorithm. Our mining strategy is effective for global search, especially when the searched space is so large that it is hardly possible to use deterministic search method.

international conference on information technology coding and computing | 2003

Identifying frequent terms in text databases by association semantics

Xiaowei Yan; Chengqi Zhang; Shichao Zhang

Existing information retrieval methods are mainly based on either term similarity or latent semantics. To reduce irrelevant information searched, this paper presents a new approach for information retrieval by applying the methodology of association rule mining to a text database. Association semantics among terms of a document and a query are considered, such that the semantic similarity between the document and query may be reduced if they are somewhat irrelevant.

IEEE Computational Intelligence Bulletin | 2005

Association-based Segmentation for Chinese-crossed Query Expansion

Chengqi Zhang; Zhenxing Qin; Xiaowei Yan

software engineering and knowledge engineering | 2003

A Database-Independent Strategy for Confidence Determination.

Xiaowei Yan; Chengqi Zhang; Shichao Zhang

Archive | 2003

Indexing by conditional association semantics

Xiaowei Yan; Chengqi Zhang; Shichao Zhang; Zhenxing Qin

Explore More

Collaboration

Dive into the Xiaowei Yan's collaboration.

Top Co-Authors

Shichao Zhang

Guangxi Normal University

View shared research outputs

Top Co-Authors

Li Niu

Guangxi Normal University

View shared research outputs

Top Co-Authors

Shichao Zhang

Guangxi Normal University

View shared research outputs

Explore More

University of Technology, Sydney

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot

Dive into the research topics where Xiaowei Yan is active.

Publication

Featured researches published by Xiaowei Yan.

Genetic algorithm-based strategy for identifying association rules without specifying actual minimum support

ARMGA: IDENTIFYING INTERESTING ASSOCIATION RULES WITH GENETIC ALGORITHMS

Toward databases mining: pre-processing collected data

ON DATA STRUCTURES FOR ASSOCIATION RULE DISCOVERY

CONFIDENCE METRICS FOR ASSOCIATION RULE MINING

IDENTIFYING SOFTWARE COMPONENT ASSOCIATION WITH GENETIC ALGORITHM

Identifying frequent terms in text databases by association semantics

Association-based Segmentation for Chinese-crossed Query Expansion

A Database-Independent Strategy for Confidence Determination.

Indexing by conditional association semantics

Collaboration

Dive into the Xiaowei Yan's collaboration.