云环境下分层的中间数据容错方法

doi:10.3778/j.issn.1673-9418.1405049

计算机科学与探索 ›› 2015, Vol. 9 ›› Issue (5): 546-554.DOI: 10.3778/j.issn.1673-9418.1405049

云环境下分层的中间数据容错方法

宋宝燕，李雪城，任才，丁琳琳+

辽宁大学信息学院，沈阳 110036

出版日期:2015-05-01 发布日期:2015-05-06

Layered Intermediate Data Fault-Tolerance Approach in Cloud

SONG Baoyan, LI Xuecheng, REN Cai, DING Linlin+

School of Information, Liaoning University, Shenyang 110036, China

Online:2015-05-01 Published:2015-05-06

摘要/Abstract

摘要： 通常在云计算框架的处理过程中会产生大量的、短暂的，同时又非常重要的中间数据。一旦有服务器失效，将会导致中间数据失效，进而影响整个任务的计算。现有的数据容错处理方法仅仅采用简单的复制策略，没有考虑中间数据的特点，会带来庞大的网络开销。因此，提出了一种有效的分层中间数据容错方法，即IDF_Support（intermediate data fault-tolerance_support）方法。通过将计算任务划分为不同类别，IDF_Support方法能够有效地处理中间数据失效。提出了分层的中间数据容错算法，分别是用于解决一个任务内部容错的中间数据容错算法（Inner_Task IDF）和用于解决任务间容错的中间数据容错算法（Outer_Task IDF）。实验结果表明，这些算法在机器出现故障的情况下提高了作业响应时间，保证了系统的可靠性。

关键词: 云计算, 中间数据, 副本, 容错算法

Abstract: Cloud computing frameworks usually generate large amounts of intermediate data which are short-lived, yet are important for the completion of job. Once there are server failures, it will lead to the failures of intermediate data, and affects the computation of the whole job. However, the existing fault-tolerant processing approaches only adopt simple replication strategies which can incur significant network overhead, and have no considering of the characteristics of intermediate data. Therefore, this paper proposes an efficient layered intermediate data fault-tolerant approach, named IDF_Support (intermediate data fault-tolerance_support) approach. By dividing the computing tasks into different classifications, IDF_Support approach can effectively process the intermediate data failures. Then, this paper proposes two layered intermediate data fault-tolerant algorithms, respectively the inner task intermediate data fault-tolerant algorithm (Inner_Task IDF) which resolves the fault-tolerance within a task and the outer task intermediate data fault-tolerant algorithm (Outer_Task IDF) which resolves the fault-tolerance among tasks. The experimental results show that the proposed algorithms can improve the response time in the case of machine failure, and keep the reliability of the whole system.

Key words: cloud computing, intermediate data, replication, fault-tolerant algorithm

宋宝燕，李雪城，任才，丁琳琳. 云环境下分层的中间数据容错方法[J]. 计算机科学与探索, 2015, 9(5): 546-554.

SONG Baoyan, LI Xuecheng, REN Cai, DING Linlin. Layered Intermediate Data Fault-Tolerance Approach in Cloud[J]. Journal of Frontiers of Computer Science and Technology, 2015, 9(5): 546-554.

[1]	邵必林，贺金能，边根庆. 基于多目标分解策略的副本布局算法研究[J]. 计算机科学与探索, 2020, 14(9): 1490-1500.
[2]	吴虹佳，刘芳，刘斌，蔡志平. 分散计算：技术、应用与挑战[J]. 计算机科学与探索, 2020, 14(5): 721-730.
[3]	郑良汉，何亨，童潜，杨湘，陈享. 云环境中的多授权机构访问控制方案[J]. 计算机科学与探索, 2020, 14(11): 1865-1878.
[4]	张胜霞，田呈亮. 在幺模矩阵加密方法下的安全外包算法[J]. 计算机科学与探索, 2020, 14(1): 73-82.
[5]	陈彦橦，裴树军，苗辉. 云科学工作流截止期限约束代价优化调度算法[J]. 计算机科学与探索, 2019, 13(8): 1307-1318.
[6]	任晓莉，杨建卫，李乃乾. 云计算中基于动态虚拟化电子流密码的安全存储[J]. 计算机科学与探索, 2019, 13(8): 1331-1340.
[7]	赵倩，谢上钦，韩轲，龚青泽，冯光升，林俊宇. 远程直接内存访问与检查点相结合的容器迁移[J]. 计算机科学与探索, 2019, 13(12): 1995-2007.
[8]	谢纪东，武继刚. 间隔执行的异步副本放置策略[J]. 计算机科学与探索, 2018, 12(8): 1339-1349.
[9]	李勇，滕飞，黄齐川，李天瑞. 基于Spark的时间序列并行分解模型[J]. 计算机科学与探索, 2018, 12(7): 1055-1063.
[10]	贾大宇，信俊昌，王之琼，郭薇，王国仁. 区块链的存储容量可扩展模型[J]. 计算机科学与探索, 2018, 12(4): 525-535.
[11]	裴树军，宋冬梅，孔德凯. Map/Reduce下快速剪枝算法在复杂任务调度中的应用[J]. 计算机科学与探索, 2018, 12(1): 72-81.
[12]	刘沛东，安博，钟业弘，王虎，曹东刚. 私有云环境下基于虚拟集群的资源共享方法[J]. 计算机科学与探索, 2017, 11(8): 1204-1213.
[13]	崔波，李茹，刘靖，张玉军，李忠诚. 有限空间下的自治移动云存储协议[J]. 计算机科学与探索, 2017, 11(2): 271-285.
[14]	杨松霖，张广艳. 纠删码存储系统中数据修复方法综述[J]. 计算机科学与探索, 2017, 11(10): 1531-1544.
[15]	李帅，党鑫，王旭，武继刚. 副本放置中的更新策略及算法[J]. 计算机科学与探索, 2016, 10(11): 1633-1640.

云环境下分层的中间数据容错方法

Layered Intermediate Data Fault-Tolerance Approach in Cloud

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics