分布式随机方差消减梯度下降算法topkSVRG

doi:10.3778/j.issn.1673-9418.1705044

计算机科学与探索 ›› 2018, Vol. 12 ›› Issue (7): 1047-1054.DOI: 10.3778/j.issn.1673-9418.1705044

分布式随机方差消减梯度下降算法topkSVRG

王建飞，亢良伊，刘杰，叶丹

1. 中国科学院软件研究所软件工程技术研发中心，北京 100190
2. 中国科学院大学，北京 100190

出版日期:2018-07-01 发布日期:2018-07-06

Distributed Stochastic Variance Reduction Gradient Descent Algorithm topkSVRG

WANG Jianfei, KANG Liangyi, LIU Jie, YE Dan

1. Technology Center of Software Engineering, Institute of Software, Chinese Academy of Sciences, Beijing 100190, China
2. University of Chinese Academy of Sciences, Beijing 100190, China

Online:2018-07-01 Published:2018-07-06

摘要/Abstract

摘要：

机器学习问题通常会转换成一个目标函数进行求解，优化算法是求解目标函数中参数的重要工具。随机梯度下降（stochastic gradient descent，SGD）是目前应用最广的算法，因其易受噪声干扰只能达到次线性收敛率，而改进后的随机方差消减梯度法（stochastic variance reduction gradient，SVRG）则可以达到线性的收敛率。SVRG是一种串行单机版算法，为了应对大规模数据集分布式训练问题，设计一种以SVRG算法思想为基础的分布式SVRG的实现算法topkSVRG。改进在于：主节点维护一个全局模型，从节点基于本地数据进行局部模型更新。每轮迭代时，选择与当前全局模型距离最小的k个局部模型进行平均来更新全局模型，参数k调大可以提高收敛速度，调小k可以保证收敛。理论分析了算法的线性收敛性，基于Spark进行算法实现，通过与Mini-Batch SGD、CoCoA、Splash及相关算法的实验比较，topkSVRG可以在高精度要求下更快地收敛。

关键词: 机器学习, 优化, 随机梯度下降（SGD）, 随机方差消减梯度法（SVRG）, 分布式计算

Abstract:

Machine learning problems are usually converted into an objective function to optimize, and the optimization algorithms are the important tool to solve the parameters in the objective function. Currently, stochastic gradient descent (SGD) is one of the most widely used optimization methods, but it is susceptible to noise gradient so as to get sub-linear convergence rate. The improved stochastic variance reduction gradient (SVRG) can achieve linear convergence rate, but SVRG is a serial algorithm. In order to deal with the distributed training problem of large-scale data set, this paper designs topkSVRG based on SVRG algorithm. The improvement is that the master node maintains a global model, and the local nodes update the local model according to the local data. In each epoch, the global model is updated by selecting k local models which have the smallest distance from the current global model. Usually with a bigger k, the model can converge faster, while with a smaller k, the convergence rate can be guar- anteed. topkSVRG has been proved linear convergence rate by theoretical analysis. topkSVRG is implemented on Spark, and experiments demonstrate its efficiency compared with Mini-Batch SGD, CoCoA, Splash, and so on.

Key words: machine learning, optimization, stochastic gradient descent (SGD), stochastic variance reduction gradient (SVRG), distributed computing

王建飞，亢良伊，刘杰，叶丹. 分布式随机方差消减梯度下降算法topkSVRG[J]. 计算机科学与探索, 2018, 12(7): 1047-1054.

WANG Jianfei, KANG Liangyi, LIU Jie, YE Dan. Distributed Stochastic Variance Reduction Gradient Descent Algorithm topkSVRG[J]. Journal of Frontiers of Computer Science and Technology, 2018, 12(7): 1047-1054.

[1]	任龙杰，孙颖，丁卫平，鞠恒荣，曹金鑫. 基于单种群蛙跳优化CNN的眼底图像多病变检测[J]. 计算机科学与探索, 2021, 15(9): 1762-1772.
[2]	刘一凡，游晓明，刘升. 基于动态重组和协同交流策略的蚁群优化算法[J]. 计算机科学与探索, 2021, 15(8): 1511-1525.
[3]	何金栋，卜艳玲，石聪聪，谢磊. 面向RFID动态帧时隙ALOHA协议的帧长优化[J]. 计算机科学与探索, 2021, 15(7): 1227-1236.
[4]	杨悦，王士同. 随机特征映射的四层神经网络及其增量学习[J]. 计算机科学与探索, 2021, 15(7): 1265-1278.
[5]	赵泽渊，代永强. 改进混合二进制蝗虫优化特征选择算法[J]. 计算机科学与探索, 2021, 15(7): 1339-1349.
[6]	钟倩漪，钱谦，伏云发，冯勇. 粒子群优化算法在关联规则挖掘中的研究综述[J]. 计算机科学与探索, 2021, 15(5): 777-793.
[7]	孙泽宇，李传锋，邢萧飞，来纯晓. 传感网中带有可控阈值的优化协同覆盖算法[J]. 计算机科学与探索, 2021, 15(5): 893-906.
[8]	马瑞强，宋宝燕，丁琳琳，王俊陆. 面向时间序列事件的动态矩阵聚类方法[J]. 计算机科学与探索, 2021, 15(3): 468-477.
[9]	赵雪莉，卢光跃，吕少卿，张潘. 结合属性信息的二分网络表示学习[J]. 计算机科学与探索, 2021, 15(3): 495-505.
[10]	高雷阜，荣雪娇. 融合递减策略与Fuch混沌机制的改进YSGA算法[J]. 计算机科学与探索, 2021, 15(3): 564-576.
[11]	黄光球，陆秋琴. 种群具有离散Leslie年龄结构的动力学优化算法[J]. 计算机科学与探索, 2021, 15(2): 354-365.
[12]	马永杰，徐小冬，张茹，谢艺蓉，陈宏. 生成式对抗网络及其在图像生成中的研究进展[J]. 计算机科学与探索, 2021, 15(10): 1795-1811.
[13]	黄光球，陆秋琴. 具脉冲出生和季节性捕杀的种群系统优化算法[J]. 计算机科学与探索, 2021, 15(10): 2002-2014.
[14]	邵必林，贺金能，边根庆. 基于多目标分解策略的副本布局算法研究[J]. 计算机科学与探索, 2020, 14(9): 1490-1500.
[15]	马志程，袁海峰，谷洋，刘亚茹，张孝. 文档-关系数据查询执行技术研究与实现[J]. 计算机科学与探索, 2020, 14(8): 1315-1326.

分布式随机方差消减梯度下降算法topkSVRG

Distributed Stochastic Variance Reduction Gradient Descent Algorithm topkSVRG

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics