计算机科学与探索 ›› 2013, Vol. 7 ›› Issue (7): 577-591.DOI: 10.3778/j.issn.1673-9418.1304023

• 综述·探索 • 上一篇    下一篇

服务软件系统的健康管理综述

陈  光+,白晓颖,刘永利,周立柱   

  1. 清华大学 计算机科学与技术系,北京 100084
  • 出版日期:2013-07-01 发布日期:2013-07-02

Survey on Health Management of Service-Based Software System

CHEN Guang+, BAI Xiaoying, LIU Yongli, ZHOU Lizhu   

  1. Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China
  • Online:2013-07-01 Published:2013-07-02

摘要: 服务软件系统具有开放性和动态性的特点,给软件质量控制提出了新的挑战。软件健康管理由应用于硬件系统的预测与健康管理思想延伸而来,由在线监测、健康诊断、管理与决策、故障恢复四个方面构成。它通过调节或者重构等方法增强了系统故障诊断、隔离和修复的能力,提高了系统的可用性和可靠性。介绍了软件健康管理的发展背景和主要研究方法,分析了服务软件松散耦合、开放、动态等特点给软件健康管理研究带来的新的需求,并从运行监测、故障诊断和动态重构三个方面探讨了服务软件的软件健康管理热点问题和研究现状。

关键词: 软件健康管理, 软件即服务, 面向服务的体系结构

Abstract: The open and dynamic nature of service-based software system imposes new challenges to software quality control. Emerging from hardware prognostics and health management (PHM), software health management (SHM) is a promising technique to improve system availability and reliability. SHM is composed of four parts including runtime monitoring, health diagnosis, management and decision, and rejuvenation. By software adjustment or reconfiguration, it enforces the abilities of fault diagnosis, isolation and recovery of runtime software, and improves the availability and reliability. This paper introduces SHM concepts, major methods and research achievements in traditional software architecture. Particularly, it investigates new needs of SHM due to loose coupling, openness and dynamics in service-based systems, and discusses key enabling techniques including runtime monitoring, fault diagnosis and service reconfiguration.

Key words: software health management, software as a service, service-oriented architecture