Jason Lines
University of East Anglia
Network
Latest external collaboration on country level. Dive into details by clicking on the dots.
Publication
Featured researches published by Jason Lines.
Data Mining and Knowledge Discovery | 2014
Jon Hills; Jason Lines; Edgaras Baranauskas; James Mapp; Anthony J. Bagnall
Time-series classification (TSC) problems present a specific challenge for classification algorithms: how to measure similarity between series. A shapelet is a time-series subsequence that allows for TSC based on local, phase-independent similarity in shape. Shapelet-based classification uses the similarity between a shapelet and a series as a discriminatory feature. One benefit of the shapelet approach is that shapelets are comprehensible, and can offer insight into the problem domain. The original shapelet-based classifier embeds the shapelet-discovery algorithm in a decision tree, and uses information gain to assess the quality of candidates, finding a new shapelet at each node of the tree through an enumerative search. Subsequent research has focused mainly on techniques to speed up the search. We examine how best to use the shapelet primitive to construct classifiers. We propose a single-scan shapelet algorithm that finds the best
knowledge discovery and data mining | 2012
Jason Lines; Luke M. Davis; Jon Hills; Anthony J. Bagnall
Data Mining and Knowledge Discovery | 2015
Jason Lines; Anthony J. Bagnall
k
IEEE Transactions on Knowledge and Data Engineering | 2015
Anthony J. Bagnall; Jason Lines; Jon Hills; Aaron Bostrom
intelligent data engineering and automated learning | 2011
Jason Lines; Anthony J. Bagnall; Patrick Caiger-Smith; Simon Bryan Patrick Anderson
k shapelets, which are used to produce a transformed dataset, where each of the
intelligent data engineering and automated learning | 2012
Jason Lines; Anthony J. Bagnall
International Journal of Neural Systems | 2012
Luke M. Davis; Barry-John Theobald; Jason Lines; Andoni P. Toms; Anthony J. Bagnall
k
international conference on data engineering | 2016
Anthony J. Bagnall; Jason Lines; Jon Hills; Aaron Bostrom
international conference on data mining | 2016
Jason Lines; Sarah Taylor; Anthony J. Bagnall
k features represent the distance between a time series and a shapelet. The primary advantages over the embedded approach are that the transformed data can be used in conjunction with any classifier, and that there is no recursive search for shapelets. We demonstrate that the transformed data, in conjunction with more complex classifiers, gives greater accuracy than the embedded shapelet tree. We also evaluate three similarity measures that produce equivalent results to information gain in less time. Finally, we show that by conducting post-transform clustering of shapelets, we can enhance the interpretability of the transformed data. We conduct our experiments on 29 datasets: 17 from the UCR repository, and 12 we provide ourselves.
ACM Transactions on Knowledge Discovery From Data | 2018
Jason Lines; Sarah Taylor; Anthony J. Bagnall
The problem of time series classification (TSC), where we consider any real-valued ordered data a time series, presents a specific machine learning challenge as the ordering of variables is often crucial in finding the best discriminating features. One of the most promising recent approaches is to find shapelets within a data set. A shapelet is a time series subsequence that is identified as being representative of class membership. The original research in this field embedded the procedure of finding shapelets within a decision tree. We propose disconnecting the process of finding shapelets from the classification algorithm by proposing a shapelet transformation. We describe a means of extracting the k best shapelets from a data set in a single pass, and then use these shapelets to transform data by calculating the distances from a series to each shapelet. We demonstrate that transformation into this new data space can improve classification accuracy, whilst retaining the explanatory power provided by shapelets.