自适应的SVM增量算法

doi:10.3778/j.issn.1673-9418.1805036

计算机科学与探索 ›› 2019, Vol. 13 ›› Issue (4): 647-656.DOI: 10.3778/j.issn.1673-9418.1805036

自适应的SVM增量算法

何丽，韩克平+，刘颖

天津财经大学理工学院，天津 300222

出版日期:2019-04-01 发布日期:2019-04-10

Self-Adaptive SVM Incremental Learning Algorithm

HE Li, HAN Keping+, LIU Ying

College of Science and Technology, Tianjin University of Finance & Economics, Tianjin 300222, China

Online:2019-04-01 Published:2019-04-10

摘要/Abstract

摘要： 支持向量机（support vector machine，SVM）算法因其在小样本训练集上的优势和较好的鲁棒性，被广泛应用于处理分类问题。但是对于增量数据和大规模数据，传统的SVM分类算法不能满足需求，增量学习是解决这些问题的有效方法之一。基于数据分布的结构化描述，提出了一种自适应SVM增量学习算法。该算法根据原样本和新增样本与当前分类超平面之间的几何距离，建立了自适应的增量样本选择模型，该模型能够有效地筛选出参与增量训练的边界样本。为了平衡增量学习的速度和性能，模型分别为新增样本和原模型样本设置了基于空间分布相似性的调整系数。实验结果表明，该算法在加快分类速度的同时提高了模型性能。

关键词: 支持向量机（SVM）, 增量学习, 数据分布, 超平面距离

Abstract: SVM algorithm is widely used to deal with classification problem due to its good robustness and performance on small datasets. However, the traditional SVM algorithm fails to address some classification problems when the data are large or growing. One of the strategies to overcome this challenge is to train the classifier using an incremental learning technique. This paper illustrates a self-adaptive SVM incremental learning algorithm derived from the structured description of data distribution. According to the geometric distance between the hyperplane and samples which contain original sample set and new sample set, a self-adaptive incremental sample selection model is established. This model can filter the boundary samples during the process of increment training accurately. The adjustment coefficients based on spatial distribution similarity are set up for new samples and original model samples in order to balance the speed and performance of incremental learning. Experimental results demonstrate that the proposed algorithm has higher training speed and better performance of classifications.

Key words: support vector machine (SVM), incremental learning, data distribution, hyperplane-distance

何丽，韩克平，刘颖. 自适应的SVM增量算法[J]. 计算机科学与探索, 2019, 13(4): 647-656.

HE Li, HAN Keping, LIU Ying. Self-Adaptive SVM Incremental Learning Algorithm[J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(4): 647-656.

[1]	杨悦，王士同. 随机特征映射的四层神经网络及其增量学习[J]. 计算机科学与探索, 2021, 15(7): 1265-1278.
[2]	林浩，李雷孝，王慧. 支持向量机在智能交通系统中的研究应用综述[J]. 计算机科学与探索, 2020, 14(6): 901-917.
[3]	倪鹏，刘阳明，赵素云，陈红，李翠平. 动态模糊粗糙特征选取算法[J]. 计算机科学与探索, 2020, 14(2): 236-243.
[4]	付康安，王文剑，郭虎升. 空间相关性分析的符号数据分类方法[J]. 计算机科学与探索, 2019, 13(7): 1165-1173.
[5]	王丽娟，丁世飞. 一种粒子群优化的SVM-ELM模型[J]. 计算机科学与探索, 2019, 13(4): 657-665.
[6]	吴艺凡，梁吉业，王俊红. 基于混合采样的非平衡数据分类算法[J]. 计算机科学与探索, 2019, 13(2): 342-349.
[7]	胡良田，潘海为，谢晓芹，张志强，冯晓宁. 基于NSCT的乳腺图像分类方法[J]. 计算机科学与探索, 2018, 12(4): 618-628.
[8]	彭清，季桂树，谢林江，张少波. 卷积神经网络在车辆识别中的应用[J]. 计算机科学与探索, 2018, 12(2): 282-291.
[9]	王宏杰，滕飞，李天瑞. 模块度引导下的社区发现增量学习算法[J]. 计算机科学与探索, 2017, 11(4): 556-564.
[10]	鲁淑霞，周谧，金钊. 非均衡加权随机梯度下降SVM在线算法[J]. 计算机科学与探索, 2017, 11(10): 1662-1671.
[11]	赵一，何克清，李昭，黄贻望. 微博演化网络的负信息分类方法[J]. 计算机科学与探索, 2017, 11(1): 91-98.
[12]	汪洪桥，蔡艳宁，付光远，王仕成. 非平坦函数概率密度估计[J]. 计算机科学与探索, 2016, 10(4): 589-599.
[13]	孙桃，谢振平，王士同，刘渊. 容量约束的自组织增量联想记忆模型[J]. 计算机科学与探索, 2016, 10(1): 130-141.
[14]	刘三民，王忠群，刘涛，修宇. 融合互近邻降噪的动态数据流分类研究[J]. 计算机科学与探索, 2016, 10(1): 36-42.
[15]	郭延祥，陈耀武. 基于边缘检测和颜色纹理直方图的车牌定位方法[J]. 计算机科学与探索, 2014, 8(6): 719-726.

自适应的SVM增量算法

Self-Adaptive SVM Incremental Learning Algorithm

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics