多元时序模糊聚类分段挖掘算法

Multivariate time series fuzzy clustering segmentation mining algorithm

  • 摘要: 工业监控系统所采集到的多元时间序列在利用数据挖掘技术获取内部存在的未知模式的过程中,经常会出现原始数据庞杂、分段结果重复、交集过多和界限不清晰等问题,导致含有突变变量或数据间相关性差的数据集进行模式挖掘结果不理想.针对上述问题,本文提出了一种新的多元时序模糊聚类分段挖掘算法.实验结果表明,该算法克服了Gath-Geva算法聚类精度易受初始值影响的不足,能够较好地反映出原始数据中潜在的过程变化,从而有效地处理时间序列的分段问题并得到理想的挖掘结果.

     

    Abstract: Multivariate time series collected by industrial monitoring systems often have problems such as numerous raw data, repeated segmentation results, redundant intersections and blurry boundaries in the process of using data mining technologies to acquire internal existing unknown patterns, leading to unsatisfied mining results when the dataset involves mutation variables or inferior relevance among the data. To resolve these problems, this article introduces a new multiple time sequence clustering algorithm. Experimental results show that this algorithm can overcome the shortage that the accuracy of clustering is often affected by initial values in the Gath-Geva algorithm. It can exhibit the potential variation of raw data and thus efficiently deal with segmentation in multivariate time series to get ideal mining results.

     

/

返回文章
返回