Journal of Frontiers of Computer Science and Technology ›› 2020, Vol. 14 ›› Issue (7): 1142-1153.DOI: 10.3778/j.issn.1673-9418.1907013

Previous Articles     Next Articles

Weighted Fuzzy Clustering Algorithm Based on Dynamic Interval

LUO Hao, WANG Yanjie, NIU Minghang, QIU Cunyue, ZHANG Li   

  1. College of Information, Liaoning University, Shenyang 110036, China
  • Online:2020-07-01 Published:2020-08-12



  1. 辽宁大学 信息学院,沈阳 110036


Clustering is widely used in data mining and data analysis, and a great many of troubles have been caused by incomplete data in clustering. Aiming at the inaccurate problem of filling missing attributes with estimation method in incomplete data clustering, a weighted fuzzy clustering algorithm is proposed based on dynamic interval. Firstly, the nearest neighbor sample sets of the missing attribute are constructed by the attribute correlation and then the missing attribute interval is formed. To further reduce the interval filling error, the interval factor which is based on the dispersion of the nearest neighbor sample set is used to adjust the interval size. Secondly, in order to fully exploit the implicit information of the attribute space and reduce the influence of the outliers on the cluster center, complete interval datasets are weighted based on local density of samples. Finally, the interval weighted fuzzy clustering is completed by the above improvement. The proposed clustering algorithm is verified by multiple UCI datasets and artificial datasets. The experimental results show that the weighted fuzzy clustering algorithm of dynamic interval can effectively improve the clustering accuracy, robustness and stability of convergence.

Key words: incomplete data, interval filling, weighting, clustering algorithm



关键词: 不完整数据, 区间填补, 加权, 聚类算法