A new method based on ensemble time series for fast and accurate clustering
Data Technologies and Applications
ISSN: 2514-9288
Article publication date: 16 March 2023
Issue publication date: 15 November 2023
Abstract
Purpose
The common methods for clustering time series are the use of specific distance criteria or the use of standard clustering algorithms. Ensemble clustering is one of the common techniques used in data mining to increase the accuracy of clustering. In this study, based on segmentation, selecting the best segments, and using ensemble clustering for selected segments, a multistep approach has been developed for the whole clustering of time series data.
Design/methodology/approach
First, this approach divides the time series dataset into equal segments. In the next step, using one or more internal clustering criteria, the best segments are selected, and then the selected segments are combined for final clustering. By using a loop and how to select the best segments for the final clustering (using one criterion or several criteria simultaneously), two algorithms have been developed in different settings. A logarithmic relationship limits the number of segments created in the loop.
Finding
According to Rand's external criteria and statistical tests, at first, the best setting of the two developed algorithms has been selected. Then this setting has been compared to different algorithms in the literature on clustering accuracy and execution time. The obtained results indicate more accuracy and less execution time for the proposed approach.
Originality/value
This paper proposed a fast and accurate approach for time series clustering in three main steps. This is the first work that uses a combination of segmentation and ensemble clustering. More accuracy and less execution time are the remarkable achievements of this study.
Keywords
Citation
Ghorbanian, A. and Razavi, H. (2023), "A new method based on ensemble time series for fast and accurate clustering", Data Technologies and Applications, Vol. 57 No. 5, pp. 756-779. https://doi.org/10.1108/DTA-08-2022-0300
Publisher
:Emerald Publishing Limited
Copyright © 2023, Emerald Publishing Limited