WU Sen, WANG Qiang, JIANG Min, WEI Qing. Clustering algorithm of categorical data in consideration of sorting by weight[J]. Chinese Journal of Engineering, 2013, 35(8): 1093-1098. DOI: 10.13374/j.issn1001-053x.2013.08.016
Citation: WU Sen, WANG Qiang, JIANG Min, WEI Qing. Clustering algorithm of categorical data in consideration of sorting by weight[J]. Chinese Journal of Engineering, 2013, 35(8): 1093-1098. DOI: 10.13374/j.issn1001-053x.2013.08.016

Clustering algorithm of categorical data in consideration of sorting by weight

  • Aimed at solving the problem that part of clustering algorithms are sensitive to the data input order, a non-interference sequence index was defined, and an approach applying the non-interference sequence was proposed to sort categorical data by weight. Based on this approach, a new clustering algorithm considering sorting by weight (CABOSFV_CSW) was presented to improve CABOSFV_C, which is an efficient clustering algorithm for categorical data but sensitive to the data input order. This approach eliminates sensitivity to the data input order. UCI benchmark data sets were used to compare the proposed CABOSFV_CSW algorithm with traditional CABOSFV_C algorithm and other algorithms sensitive to the data input order. Empirical tests show that the new CABOSFV_CSW clustering algorithm for categorical data improves the accuracy and increases the stability effectively.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return