计算机科学与探索 ›› 2021, Vol. 15 ›› Issue (6): 1145-1154.DOI: 10.3778/j.issn.1673-9418.2006063

• 理论与算法 • 上一篇    下一篇

面向滑动窗口法的概念格漂移计算研究

徐霁琳,徐健锋,刘龙,吴方文   

  1. 1. 南昌大学 信息工程学院,南昌 330031
    2. 南昌大学 软件学院,南昌 330047
    3. 同济大学 电子与信息工程学院,上海 201804
  • 出版日期:2021-06-01 发布日期:2021-06-03

Research on Drift Calculation of Concept Lattice for Sliding Window Method

XU Jilin, XU Jianfeng, LIU Long, WU Fangwen   

  1. 1. College of Information Engineering, Nanchang University, Nanchang 330031, China
    2. College of Software, Nanchang University, Nanchang 330047, China
    3. College of Electronics and Information Engineering, Tongji University, Shanghai 201804, China
  • Online:2021-06-01 Published:2021-06-03

摘要:

概念格是一种数据分析和规则获取的有效工具,近年来概念格的应用和研究已逐渐成为数据分析领域的一个重要研究方向。当前随着信息技术的发展,流数据成为了大数据的重要组成部分,流数据知识挖掘中普遍存在的概念漂移已经成为近年来机器学习领域的热点问题。动态概念格的构造是概念格理论研究的重要研究任务,但是在流数据环境中进行概念格的概念漂移研究在学术界还没有展开。针对流数据环境中概念格的漂移问题,提出了一种面向滑动窗口法的概念格漂移计算方法。首先对滑动窗口中的流数据进行建模;然后对滑动窗口中的流入流出概念相同、流入流出概念不同、流入流出概念部分相交、流入概念包含流出概念和流出概念包含流入概念这五种现象分别进行推理研究;最后基于上述模型理论推理,提出面向滑动窗口法的概念格构造算法,并用实例说明了该算法的有效性和高效性。

关键词: 概念格, 流数据, 概念漂移, 滑动窗口, 动态概念

Abstract:

Concept lattice is an effective tool for data analysis and rule acquisition. In recent years, the application and research of concept lattice has gradually become an important research direction in the field of data analysis. With the development of information technology, stream data have become an important part of big data, and the concept drift in stream data mining has become a hot topic in machine learning. The construction of dynamic concept lattice is an important research task of concept lattice theory, but the research of concept lattice drift in streaming data environment has not been carried out. To solve the problem of concept lattice drift in stream data environment, the drift calculation method of concept lattice based on sliding window method is proposed in this paper. First, the stream data in the sliding window are modeled. Then, in the sliding window, this paper conducts inference research separately for five phenomena, i.e. the same inflow and outflow concepts, different inflow and outflow concepts, partial intersection of inflow and outflow concepts, inflow concept including outflow concept, and the outflow concept including the inflow concept. Finally, based on the above model reasoning, a concept lattice construction algorithm based on sliding window method is proposed, and an example is given to illustrate the effectiveness and efficiency of the algorithm.

Key words: concept lattice, stream data, concept drift, sliding window, dynamic concept