Imbalanced Weighted Stochastic Gradient Descent Online Algorithm for SVM

doi:10.3778/j.issn.1673-9418.1609009

Abstract

Abstract: Stochastic gradient descent (SGD) has been applied to large scale support vector machine (SVM) training. Stochastic gradient descent takes a random way to select points during training process, this leads to a result that the probability of choosing majority class is far greater than that of choosing minority class for imbalanced classification problem. In order to deal with large scale imbalanced data classification problems, this paper proposes a method named weighted stochastic gradient descent algorithm for SVM. After the samples in the majority class are assigned a smaller weight while the samples in the minority class are assigned a larger weight, the weighted stochastic gradient descent algorithm will be used to solving the primal problem of SVM, which helps to reduce the hyperplane offset to the minority class, thus solves the large scale imbalanced data classification problems.

Key words: stochastic gradient descent (SGD), weight, imbalanced data, large scale learning, support vector machine (SVM)

摘要： 随机梯度下降（stochastic gradient descent，SGD）方法已被应用于大规模支持向量机（support vector machine，SVM）训练，其在训练时采取随机选点的方式，对于非均衡分类问题，导致多数类点被抽取到的概率要远远大于少数类点，造成了计算上的不平衡。为了处理大规模非均衡数据分类问题，提出了加权随机梯度下降的SVM在线算法，对于多数类中的样例被赋予较小的权值，而少数类中的样例被赋予较大的权值，然后利用加权随机梯度下降算法对SVM原问题进行求解，减少了超平面向少数类的偏移，较好地解决了大规模学习中非均衡数据的分类问题。

关键词: 随机梯度下降（SGD）, 权, 非均衡数据, 大规模学习, 支持向量机（SVM）

LU Shuxia, ZHOU Mi, JIN Zhao. Imbalanced Weighted Stochastic Gradient Descent Online Algorithm for SVM[J]. Journal of Frontiers of Computer Science and Technology, 2017, 11(10): 1662-1671.

鲁淑霞，周谧，金钊. 非均衡加权随机梯度下降SVM在线算法[J]. 计算机科学与探索, 2017, 11(10): 1662-1671.

[1]	WANG Dicong, BAI Chenshuai, WU Kaijun. Survey of Video Object Detection Based on Deep Learning [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(9): 1563-1577.
[2]	REN Longjie, SUN Ying, DING Weiping, JU Hengrong, CAO Jinxin. Multiple Lesions Detection of Fundus Images Based on CNN Algorithm Optimized by Single Population Frog-Leaping Algorithm [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(9): 1762-1772.
[3]	LI Chengyan, SONG Yue, MA Jintao. RIOPSO Algorithm for Fuzzy Cloud Resource Scheduling Problem [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(8): 1534-1545.
[4]	FAN Ruidong, HOU Chenping. Robust Auto-weighted Multi-view Subspace Clustering [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(6): 1062-1073.
[5]	LIANG Ling, DENG Zhaohong, WANG Shitong. Multi-view Fuzzy Clustering Combining Visual and Hidden Information with Feature Weighting [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(6): 1092-1102.
[6]	CHEN Cheng, HE Xingshi, YANG Xinshe. Double Cuckoo Search Algorithm with Dynamically Adjusted Probability [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(5): 859-880.
[7]	SU Jiangyi, SONG Xiaoning, WU Xiaojun, YU Dongjun. Skeleton Based Action Recognition Algorithm on Multi-modal Lightweight Graph Convolutional Network [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(4): 733-742.
[8]	XIAO Zhenjiu, YANG Xiaodi, WEI Xian, TANG Xiaoliang. Improved Lightweight Network in Image Recognition [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(4): 743-753.
[9]	ZHANG Wei, DENG Zhaohong, WANG Shitong. Kernel-Induced Incomplete Multi-view Clustering [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(2): 284-293.
[10]	XUE Hongyan, QIAN Xuezhong, ZHOU Shibing. Ensemble Clustering Algorithm Based on Weighted Super Cluster [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(12): 2362-2373.
[11]	PAN Jiawen, QIAN Qian, FU Yunfa, FENG Yong. Multi-population Genetic Algorithm Based on Optimal Weight Dynamic Control Learning Mechanism [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(12): 2421-2437.
[12]	MENG Xianfa, LIU Fang, LI Guang, HUANG Mengmeng. Review of Knowledge Distillation in Convolutional Neural Network Compression [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(10): 1812-1829.
[13]	LANG Lei, XIA Yingqing. Survey on Compact Neural Network Model Design [J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(9): 1456-1470.
[14]	LUO Hao, WANG Yanjie, NIU Minghang, QIU Cunyue, ZHANG Li. Weighted Fuzzy Clustering Algorithm Based on Dynamic Interval [J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(7): 1142-1153.
[15]	LIN Hao, LI Leixiao, WANG Hui. Survey on Research and Application of Support Vector Machines in Intelligent Transportation System [J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(6): 901-917.

Imbalanced Weighted Stochastic Gradient Descent Online Algorithm for SVM

非均衡加权随机梯度下降SVM在线算法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics