加权局部方差优化初始簇中心的K-means算法

doi:10.3778/j.issn.1673-9418.1509024

计算机科学与探索 ›› 2016, Vol. 10 ›› Issue (5): 732-741.DOI: 10.3778/j.issn.1673-9418.1509024

加权局部方差优化初始簇中心的K-means算法

蔡宇浩，梁永全，樊建聪+，李璇，刘文华

山东科技大学信息科学与工程学院，山东青岛 266590

出版日期:2016-05-01 发布日期:2016-05-04

Optimizing Initial Cluster Centroids by Weighted Local Variance in K-means Algorithm

CAI Yuhao, LIANG Yongquan, FAN Jiancong+, LI Xuan, LIU Wenhua

College of Information Science and Engineering, Shandong University of Science and Technology, Qingdao, Shandong 266590, China

Online:2016-05-01 Published:2016-05-04

摘要/Abstract

摘要： 在传统K-means算法中，初始簇中心选择的随机性，导致聚类结果随不同的聚类中心而不同。因此出现了很多簇中心的选择方法，但是很多已有的簇中心选择算法，其聚类结果受参数调节的影响较大。针对这一问题，提出了一种新的初始簇中心选择算法，称为WLV-K-means（weighted local variance K-means）。该算法采用加权局部方差度量样本的密度，以更好地发现密度高的样本，并利用改进的最大最小法，启发式地选择簇初始中心点。在UCI数据集上的实验结果表明，WLV-K-means算法不仅能够取得较好的聚类结果，而且受参数变化的影响较小，有更加稳定的表现。

关键词: K-means算法, 方差, 加权, 最大最小法, 簇初始中心点

Abstract: The selection of initial cluster centroids in the classical K-means algorithm is random, which causes that the clustering results vary with different selections of cluster centroids. Thereby many selection approaches of initial centroids are devised and applied. However, most of them are affected by parameters design and parameter values. To overcome this problem, this paper proposes a novel initial cluster centroids selection algorithm, called WLV-K-means (weighted local variance K-means). The WLV-K-means algorithm employs the weighted local variance to measure the density of each sample, which can find samples with higher density. This algorithm also uses the improved max-min method to select cluster centroid heuristically. The experiments are made on UCI datasets and the results show that the WLV-K-means algorithm outperforms some improved K-means algorithms and is more stable and robust.

Key words: K-means algorithm, variance, weighting, max-min method, initial cluster centroids

蔡宇浩，梁永全，樊建聪，李璇，刘文华. 加权局部方差优化初始簇中心的K-means算法[J]. 计算机科学与探索, 2016, 10(5): 732-741.

CAI Yuhao, LIANG Yongquan, FAN Jiancong, LI Xuan, LIU Wenhua . Optimizing Initial Cluster Centroids by Weighted Local Variance in K-means Algorithm[J]. Journal of Frontiers of Computer Science and Technology, 2016, 10(5): 732-741.

[1]	范瑞东, 侯臣平. 鲁棒自加权的多视图子空间聚类[J]. 计算机科学与探索, 2021, 15(6): 1062-1073.
[2]	梁凌, 邓赵红, 王士同. 兼顾显隐信息与特征加权的多视角模糊聚类[J]. 计算机科学与探索, 2021, 15(6): 1092-1102.
[3]	张炜, 邓赵红, 王士同. 基于核诱导的不完整多视角聚类[J]. 计算机科学与探索, 2021, 15(2): 284-293.
[4]	薛红艳, 钱雪忠, 周世兵. 超簇加权的集成聚类算法[J]. 计算机科学与探索, 2021, 15(12): 2362-2373.
[5]	罗浩，王彦捷，牛明航，邱存月，张利. 动态区间的加权模糊聚类算法[J]. 计算机科学与探索, 2020, 14(7): 1142-1153.
[6]	陈兴国，徐修颖，陈康扬，杨光. 基于CMAES集成学习方法的地表水质分类[J]. 计算机科学与探索, 2020, 14(3): 426-436.
[7]	胡健，徐锴滨，毛伊敏. 基于加权网格和信息熵的并行密度聚类算法[J]. 计算机科学与探索, 2020, 14(12): 2094-2107.
[8]	王小玉，韩昌林，胡鑫豪. 加权特征融合的密集连接网络人脸识别算法[J]. 计算机科学与探索, 2019, 13(7): 1195-1205.
[9]	魏明桦，郑金贵. 自适应目标与内容匹配的层级图像分割算法[J]. 计算机科学与探索, 2019, 13(4): 681-692.
[10]	房立超，王钰，杨杏丽，李济洪. 方差正则化的分类模型选择准则[J]. 计算机科学与探索, 2019, 13(3): 457-467.
[11]	陈家益，战荫伟，曹会英，吴兴达，李小飞. 修剪中值检测的自适应加权中值滤波算法[J]. 计算机科学与探索, 2019, 13(3): 505-513.
[12]	阮传扬，韩莉娜. 考虑区间元素个数的区间犹豫模糊决策方法[J]. 计算机科学与探索, 2018, 12(9): 1513-1521.
[13]	王敏，李永明. 强赋值幺半群上的加权Mealy机与加权Moore机的关系[J]. 计算机科学与探索, 2018, 12(8): 1331-1338.
[14]	王建飞，亢良伊，刘杰，叶丹. 分布式随机方差消减梯度下降算法topkSVRG[J]. 计算机科学与探索, 2018, 12(7): 1047-1054.
[15]	杨艳，许道云. 优化加权核K-means聚类初始中心点的SLIC算法[J]. 计算机科学与探索, 2018, 12(3): 494-501.

加权局部方差优化初始簇中心的K-means算法

Optimizing Initial Cluster Centroids by Weighted Local Variance in K-means Algorithm

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics