计算机科学与探索 ›› 2014, Vol. 8 ›› Issue (5): 593-600.DOI: 10.3778/j.issn.1673-9418.1307018

• 人工智能与模式识别 • 上一篇    下一篇

CCA三支决策模型的边界域样本处理

张燕平1,2,邹慧锦1,2,邢  航1,2,赵  姝1,2+   

  1. 1. 安徽大学 计算机科学与技术学院,合肥 230601
    2. 安徽大学 计算智能与信号处理教育部重点实验室,合肥 230601
  • 出版日期:2014-05-01 发布日期:2014-05-05

Dealing with Samples in Boundary Regions of Three-Way Decisions Model Based on CCA

ZHANG Yanping1,2, ZOU Huijin1,2, XING Hang1,2, ZHAO Shu1,2+   

  1. 1. School of Computer Science and Technology, Anhui University, Hefei 230601, China
    2. Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Anhui University, Hefei 230601, China
  • Online:2014-05-01 Published:2014-05-05

摘要: 构造性覆盖算法(constructive covering algorithm,CCA)三支决策模型在学习过程中根据样本分布特征,自动产生正域、负域和边界域。该模型为边界域样本处理问题提供了新的思路。重点讨论了基于CCA的三支决策的边界域样本处理问题。对边界域样本处理提出了两种决策方案:一种为处理全部的边界域样本,给出了距中心最近原则、距边界最近原则、万有引力原则3种方法;另一种为处理部分的边界域样本,即只对满足一定条件的边界域样本作进一步的划分,这样使不满足条件的边界域样本仍保留在边界域,提高了边界域样本处理的正确率。用十字交叉法在5组数据集上对这两种决策方案进行了对比,实验结果表明,处理部分边界域样本时正确率更高,效果更好。

关键词: 构造性覆盖算法(CCA), 边界域样本处理, 三支决策

Abstract: The three-way decisions model based on constructive covering algorithm (CCA) produces automatically three regions: positive regions (POS), negative regions (NEG) and boundary regions (BND) according to the distribution characteristic of samples. The model provides a new idea to deal with samples in BND for the theory of three-way decisions. This paper proposes two methods to deal with samples in BND: The first method deals with all samples in BND, which gives three principles, nearest to the center principle, nearest to the boundary principle and gravity principle; The second one deals with part of samples in BND, i.e., the samples which satisfy some conditions are dealt with, and the rest samples are still in the BND. The second method promotes the accuracy. This paper compares the two methods on five datasets using 10-fold cross-validation. The experimental results show that the effect and accuracy of dealing with part of samples in BND are better than those of dealing with all samples in BND.

Key words: constructive covering algorithm (CCA), dealing with samples in boundary regions, three-way decisions