非均衡加权随机梯度下降SVM在线算法

doi:10.3778/j.issn.1673-9418.1609009

计算机科学与探索 ›› 2017, Vol. 11 ›› Issue (10): 1662-1671.DOI: 10.3778/j.issn.1673-9418.1609009

非均衡加权随机梯度下降SVM在线算法

鲁淑霞+，周谧，金钊

河北大学数学与信息科学学院河北省机器学习与计算智能重点实验室，河北保定 071002

出版日期:2017-10-01 发布日期:2017-10-20

Imbalanced Weighted Stochastic Gradient Descent Online Algorithm for SVM

LU Shuxia+, ZHOU Mi, JIN Zhao

Hebei Province Key Laboratory of Machine Learning and Computational Intelligence, College of Mathematics and Information Science, Hebei University, Baoding, Hebei 071002, China

Online:2017-10-01 Published:2017-10-20

摘要/Abstract

摘要： 随机梯度下降（stochastic gradient descent，SGD）方法已被应用于大规模支持向量机（support vector machine，SVM）训练，其在训练时采取随机选点的方式，对于非均衡分类问题，导致多数类点被抽取到的概率要远远大于少数类点，造成了计算上的不平衡。为了处理大规模非均衡数据分类问题，提出了加权随机梯度下降的SVM在线算法，对于多数类中的样例被赋予较小的权值，而少数类中的样例被赋予较大的权值，然后利用加权随机梯度下降算法对SVM原问题进行求解，减少了超平面向少数类的偏移，较好地解决了大规模学习中非均衡数据的分类问题。

关键词: 随机梯度下降（SGD）, 权, 非均衡数据, 大规模学习, 支持向量机（SVM）

Abstract: Stochastic gradient descent (SGD) has been applied to large scale support vector machine (SVM) training. Stochastic gradient descent takes a random way to select points during training process, this leads to a result that the probability of choosing majority class is far greater than that of choosing minority class for imbalanced classification problem. In order to deal with large scale imbalanced data classification problems, this paper proposes a method named weighted stochastic gradient descent algorithm for SVM. After the samples in the majority class are assigned a smaller weight while the samples in the minority class are assigned a larger weight, the weighted stochastic gradient descent algorithm will be used to solving the primal problem of SVM, which helps to reduce the hyperplane offset to the minority class, thus solves the large scale imbalanced data classification problems.

Key words: stochastic gradient descent (SGD), weight, imbalanced data, large scale learning, support vector machine (SVM)

鲁淑霞，周谧，金钊. 非均衡加权随机梯度下降SVM在线算法[J]. 计算机科学与探索, 2017, 11(10): 1662-1671.

LU Shuxia, ZHOU Mi, JIN Zhao. Imbalanced Weighted Stochastic Gradient Descent Online Algorithm for SVM[J]. Journal of Frontiers of Computer Science and Technology, 2017, 11(10): 1662-1671.

[1]	任龙杰，孙颖，丁卫平，鞠恒荣，曹金鑫. 基于单种群蛙跳优化CNN的眼底图像多病变检测[J]. 计算机科学与探索, 2021, 15(9): 1762-1772.
[2]	李成严，宋月，马金涛. 模糊云资源调度问题的RIOPSO算法[J]. 计算机科学与探索, 2021, 15(8): 1534-1545.
[3]	范瑞东，侯臣平. 鲁棒自加权的多视图子空间聚类[J]. 计算机科学与探索, 2021, 15(6): 1062-1073.
[4]	梁凌，邓赵红，王士同. 兼顾显隐信息与特征加权的多视角模糊聚类[J]. 计算机科学与探索, 2021, 15(6): 1092-1102.
[5]	陈程，贺兴时，杨新社. 动态调整概率的双重布谷鸟搜索算法[J]. 计算机科学与探索, 2021, 15(5): 859-880.
[6]	张炜，邓赵红，王士同. 基于核诱导的不完整多视角聚类[J]. 计算机科学与探索, 2021, 15(2): 284-293.
[7]	薛红艳, 钱雪忠, 周世兵. 超簇加权的集成聚类算法[J]. 计算机科学与探索, 2021, 15(12): 2362-2373.
[8]	潘家文, 钱谦, 伏云发, 冯勇. 最优权动态控制学习机制的多种群遗传算法[J]. 计算机科学与探索, 2021, 15(12): 2421-2437.
[9]	罗浩，王彦捷，牛明航，邱存月，张利. 动态区间的加权模糊聚类算法[J]. 计算机科学与探索, 2020, 14(7): 1142-1153.
[10]	林浩，李雷孝，王慧. 支持向量机在智能交通系统中的研究应用综述[J]. 计算机科学与探索, 2020, 14(6): 901-917.
[11]	张其文，尉雅晨. 独立自适应调整参数的粒子群优化算法[J]. 计算机科学与探索, 2020, 14(4): 637-648.
[12]	胡健，徐锴滨，毛伊敏. 基于加权网格和信息熵的并行密度聚类算法[J]. 计算机科学与探索, 2020, 14(12): 2094-2107.
[13]	郑良汉，何亨，童潜，杨湘，陈享. 云环境中的多授权机构访问控制方案[J]. 计算机科学与探索, 2020, 14(11): 1865-1878.
[14]	李幸幸，刘华锋，景丽萍. 混合秩矩阵分解模型[J]. 计算机科学与探索, 2019, 13(7): 1114-1122.
[15]	付康安，王文剑，郭虎升. 空间相关性分析的符号数据分类方法[J]. 计算机科学与探索, 2019, 13(7): 1165-1173.

非均衡加权随机梯度下降SVM在线算法

Imbalanced Weighted Stochastic Gradient Descent Online Algorithm for SVM

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics