Research on Multi-Label Classification Method of Traditional Chinese Medicine Clinical Disease Data

doi:10.3778/j.issn.1673-9418.1705035

Journal of Frontiers of Computer Science and Technology ›› 2018, Vol. 12 ›› Issue (8): 1295-1304.DOI: 10.3778/j.issn.1673-9418.1705035

Previous Articles Next Articles

Research on Multi-Label Classification Method of Traditional Chinese Medicine Clinical Disease Data

PAN Zhuqiang1, ZHANG Lin1, ZHANG Lei2+, LI Guozheng3, YAN Shixing4

1. School of Computer Science, Southwest Petroleum University, Chengdu 610500, China
2. Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing 100700, China
3. National Data Center of Traditional Chinese Medicine, China Academy of Chinese Medical Sciences, Beijing 100700, China
4. Shanghai Menorah Information Technology Co., Ltd., Shanghai 201800, China

Online:2018-08-01 Published:2018-08-09

中医临床疾病数据多标记分类方法研究

潘主强1，张林1，张磊2+，李国正3，颜仕星4

1. 西南石油大学计算机科学学院，成都 610500
2. 中国中医科学院中医临床基础医学研究所，北京 100700

3. 中国中医科学院中医药数据中心，北京 100700

4. 上海金灯台信息科技有限公司，上海 201800

Abstract

Abstract: WML-kNN (weighted multi-label [k] nearest neighbor) learning algorithm, the number of neighbor points from fixed value, without considering the actual characteristics of the sample data, may make the high similarity point excluded from the neighbor set, or the low similarity point contained in the neighbor set, which will affect the performance of classifier. Traditional Chinese medicine (TCM) clinical data on the disease are likely to have multiple labels, and because of the particularity of the sample, each sample may have different similarity neighbors. This paper improves the WML-kNN algorithm and proposes WML-GkNN (WML-granular kNN) algorithm. In WML-GkNN algorithm, the granular control is used to control the granularity space, and the set of neighbors is determined, so that the sample points in the neighborhood have high similarity. The experimental results on the meridian resistance data collected by TCM show that the WML-GkNN algorithm improves the classification performance.

Key words: Chinese medicine clinical data, multi-label learning, granular computing, weight

摘要： WML-kNN（weighted multi-label[k]nearest neighbor）算法中近邻点个数取固定值，而没有考虑样本数据的实际特点，可能会将相似度高的点排除在近邻集外，或者将相似度低的点包含在近邻集内，这些都会影响分类器的性能。而中医（traditional Chinese medicine，TCM）临床获得的关于疾病的数据很可能是多标记的，同时由于病例的特殊性，每个病例可能具有不同的相似近邻集。因此，对WML-kNN算法进行了改进，提出WML-GkNN（WML-granular kNN）算法。该算法通过粒计算对粒度空间进行控制，从而确定近邻点集，使得邻域内的样本点有高相似性。在中医临床采集的经络电阻数据上的实验结果显示，WML-GkNN算法提高了分类性能。

关键词: 中医临床数据, 多标记学习, 粒计算, 权重

PAN Zhuqiang, ZHANG Lin, ZHANG Lei, LI Guozheng, YAN Shixing. Research on Multi-Label Classification Method of Traditional Chinese Medicine Clinical Disease Data[J]. Journal of Frontiers of Computer Science and Technology, 2018, 12(8): 1295-1304.

潘主强，张林，张磊，李国正，颜仕星. 中医临床疾病数据多标记分类方法研究[J]. 计算机科学与探索, 2018, 12(8): 1295-1304.

[1]	WANG Dicong, BAI Chenshuai, WU Kaijun. Survey of Video Object Detection Based on Deep Learning [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(9): 1563-1577.
[2]	REN Longjie, SUN Ying, DING Weiping, JU Hengrong, CAO Jinxin. Multiple Lesions Detection of Fundus Images Based on CNN Algorithm Optimized by Single Population Frog-Leaping Algorithm [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(9): 1762-1772.
[3]	LI Chengyan, SONG Yue, MA Jintao. RIOPSO Algorithm for Fuzzy Cloud Resource Scheduling Problem [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(8): 1534-1545.
[4]	FAN Ruidong, HOU Chenping. Robust Auto-weighted Multi-view Subspace Clustering [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(6): 1062-1073.
[5]	LIANG Ling, DENG Zhaohong, WANG Shitong. Multi-view Fuzzy Clustering Combining Visual and Hidden Information with Feature Weighting [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(6): 1092-1102.
[6]	CHEN Cheng, HE Xingshi, YANG Xinshe. Double Cuckoo Search Algorithm with Dynamically Adjusted Probability [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(5): 859-880.
[7]	SU Jiangyi, SONG Xiaoning, WU Xiaojun, YU Dongjun. Skeleton Based Action Recognition Algorithm on Multi-modal Lightweight Graph Convolutional Network [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(4): 733-742.
[8]	XIAO Zhenjiu, YANG Xiaodi, WEI Xian, TANG Xiaoliang. Improved Lightweight Network in Image Recognition [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(4): 743-753.
[9]	ZHANG Wei, DENG Zhaohong, WANG Shitong. Kernel-Induced Incomplete Multi-view Clustering [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(2): 284-293.
[10]	XUE Hongyan, QIAN Xuezhong, ZHOU Shibing. Ensemble Clustering Algorithm Based on Weighted Super Cluster [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(12): 2362-2373.
[11]	PAN Jiawen, QIAN Qian, FU Yunfa, FENG Yong. Multi-population Genetic Algorithm Based on Optimal Weight Dynamic Control Learning Mechanism [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(12): 2421-2437.
[12]	MENG Xianfa, LIU Fang, LI Guang, HUANG Mengmeng. Review of Knowledge Distillation in Convolutional Neural Network Compression [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(10): 1812-1829.
[13]	LANG Lei, XIA Yingqing. Survey on Compact Neural Network Model Design [J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(9): 1456-1470.
[14]	LUO Hao, WANG Yanjie, NIU Minghang, QIU Cunyue, ZHANG Li. Weighted Fuzzy Clustering Algorithm Based on Dynamic Interval [J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(7): 1142-1153.
[15]	ZHANG Qiwen, WEI Yachen. Particle Swarm Optimization with Independent Adaptive Parameter Adjustment [J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(4): 637-648.

Research on Multi-Label Classification Method of Traditional Chinese Medicine Clinical Disease Data

中医临床疾病数据多标记分类方法研究

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles 0

Metrics