动态学习机制的双种群蚁群算法

doi:10.3778/j.issn.1673-9418.1806012

计算机科学与探索 ›› 2019, Vol. 13 ›› Issue (7): 1239-1250.DOI: 10.3778/j.issn.1673-9418.1806012

动态学习机制的双种群蚁群算法

袁汪凰1+，游晓明1，刘升2

1.上海工程技术大学电子电气工程学院，上海 201600
2.上海工程技术大学管理学院，上海 201600

出版日期:2019-07-01 发布日期:2019-07-08

Dual-Population Ant Colony Algorithm on Dynamic Learning Mechanism

YUAN Wanghuang1+, YOU Xiaoming1, LIU Sheng2

1.School of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai 201600, China
2.School of Management, Shanghai University of Engineering Science, Shanghai 201600, China

Online:2019-07-01 Published:2019-07-08

摘要/Abstract

摘要： 针对蚁群算法易陷入局部最优与收敛速度较慢的不足，提出了动态学习机制的双种群蚁群算法。该算法重点引入奖惩模型，奖励算子提高算法的收敛速度，惩罚算子增加种群的多样性。由SA-MMAS（adaptive simulated annealing ant colony algorithm based on max-min ant system）和MMAS（max-min ant system）两个种群合作搜索路径，蚁群间根据不同城市规模动态地进行信息素交流，在种群交流后利用奖惩模型对双种群间的学习合作行为给予动态的反馈，从而平衡算法的多样性与收敛速度。通过17个经典旅行商问题（traveling salesman problem，TSP）实例进行验证，结果表明该算法能以较少的迭代次数取得最优解或接近最优解。对于中大规模的TSP问题效果更好，从而验证了算法的高效性和可行性。

关键词: 动态学习, 奖惩模型, 双种群, 旅行商问题

Abstract: Aiming at the deficiencies of ant colony algorithm that can easily fall into the local optimum and the convergence speed is slow, a dual population ant colony algorithm based on dynamic learning mechanism is proposed. This algorithm focuses on the reward penalty model. The reward operator improves the convergence speed of the algorithm, and the penalty operator improves the diversity of the algorithm. Two populations SA-MMAS (adaptive simulated annealing ant colony algorithm based on max-min ant system) and MMAS (max-min ant system) coopera-tively search paths, and then the ant colonies dynamically communicate pheromone according to different city sizes. After communication between the two colonies, the incentive penalty model is used to give dynamic feedback to the learning cooperative behavior between the two colonies, thus balancing the diversity and convergence speed of the algorithm. Verified by 17 classic TSP (traveling salesman problem) instances, the results show that the algorithm can obtain the optimal solution or near optimal solution with fewer iterations. It is more effective for medium and large-scale TSP, thus verifying the efficiency and feasibility of the algorithm.

Key words: dynamic learning, reward penalty model, dual population, traveling salesman problem (TSP)

袁汪凰，游晓明，刘升. 动态学习机制的双种群蚁群算法[J]. 计算机科学与探索, 2019, 13(7): 1239-1250.

YUAN Wanghuang, YOU Xiaoming, LIU Sheng. Dual-Population Ant Colony Algorithm on Dynamic Learning Mechanism[J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(7): 1239-1250.

[1]	莫亚东，游晓明，刘升. 融合奖惩学习策略的动态分级蚁群算法[J]. 计算机科学与探索, 2021, 15(9): 1703-1716.
[2]	陈斌，刘卫国. 基于SAC模型的改进遗传算法求解TSP问题[J]. 计算机科学与探索, 2021, 15(9): 1680-1693.
[3]	刘一凡，游晓明，刘升. 基于动态重组和协同交流策略的蚁群优化算法[J]. 计算机科学与探索, 2021, 15(8): 1511-1525.
[4]	纪伟，李英梅，季伟东，张珑. 粒子置换的双种群综合学习PSO算法[J]. 计算机科学与探索, 2021, 15(4): 766-776.
[5]	孟静雯，游晓明，刘升. 结合协同机制与动态调控策略的双蚁群算法[J]. 计算机科学与探索, 2021, 15(11): 2206-2221.
[6]	潘晗，游晓明，刘升. 考虑动态导向与邻域交互的双蚁型算法[J]. 计算机科学与探索, 2020, 14(6): 1005-1016.
[7]	张德惠，游晓明，刘升. 融合猫群算法的动态分组蚁群算法[J]. 计算机科学与探索, 2020, 14(5): 880-891.
[8]	刘中强，游晓明，刘升. 启发式强化学习机制的异构双种群蚁群算法[J]. 计算机科学与探索, 2020, 14(3): 460-469.
[9]	冯志雨，游晓明，刘升. 分层递进的改进聚类蚁群算法解决TSP问题[J]. 计算机科学与探索, 2019, 13(8): 1280-1294.
[10]	朱宏伟，游晓明，刘升. 协同过滤策略的异构双种群蚁群算法[J]. 计算机科学与探索, 2019, 13(10): 1754-1767.
[11]	侯乐，杨辉华，樊永显，李灵巧，蒋淑洁. 基于ILS-CS优化算法的个性化旅游线路研究[J]. 计算机科学与探索, 2016, 10(1): 142-150.
[12]	彭岳，王俊，谢斌福，张月峰，王崇骏. 改进的重叠蚁群优化算法[J]. 计算机科学与探索, 2014, 8(8): 1002-1008.

动态学习机制的双种群蚁群算法

Dual-Population Ant Colony Algorithm on Dynamic Learning Mechanism

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 12

编辑推荐

Metrics