Multi-Label Feature Extraction Method Relied on Feature-Label Dependence Auto-encoder

doi:10.3778/j.issn.1673-9418.1903053

Abstract

Abstract:

In multi-label learning, how to deal with high-dimensional features has always been one of the research difficulties. The feature extraction algorithm can effectively solve the problem of classification performance degra-dation caused by high dimensionality of data features. However, the existing multi-label feature extraction algo-rithms rarely make full use of feature information and fully extract the “feature-label” independent information and fusion information. Based on this, a multi-label feature extraction method based on feature-label dependence auto-encoder is proposed. The kernel extreme learning machine self-encoder is used to fuse the label space with the ori-ginal feature space and generate the reconstructed feature space. On the one hand, Hilbert-Schmidt independence cri-terion is maximized to make full use of the information between labels and the features; on the other hand, principal component analysis is used to reduce the information loss in the process of feature extraction. These?two?aspects are combined and the information of “feature-feature” and “feature-label” is extracted respectively. The comparison experi-ments on Yahoo high-dimensional multi-label datasets show that the performance of this algorithm is better than the current five main multi-label feature extraction methods, and the effectiveness of the proposed algorithm is verified.

Key words: multi-label feature extraction, feature-label dependence, kernel extreme learning machine, principal component analysis, autoencoder

摘要：

在多标记学习中，如何处理高维特征一直是研究难点之一，而特征提取算法可以有效解决数据特征高维性导致的分类性能降低问题。但目前已有的多标记特征提取算法很少充分利用特征信息并充分提取“特征-标记”独立信息及融合信息。基于此，提出一种基于特征标记依赖自编码器的多标记特征提取方法。使用核极限学习机自编码器将原标记空间与原特征空间融合并产生重构后的新特征空间。一方面最大化希尔伯特-施密特范数以充分利用标记信息；另一方面通过主成分分析来降低特征提取过程中的信息损失，结合二者并分别提取“特征-特征”和“特征-标记”信息。通过在Yahoo多组高维多标记数据集上的对比实验表明，该算法的性能优于当前五种主要的多标记特征提取方法，验证了所提算法的有效性。

关键词: 多标记特征提取, 特征标记依赖度, 核极限学习机, 主成分分析, 自编码器

CHENG Yusheng, LI Zhiwei, PANG Shufang. Multi-Label Feature Extraction Method Relied on Feature-Label Dependence Auto-encoder[J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(3): 470-481.

程玉胜，李志伟，庞淑芳. 特征标记依赖自编码器的多标记特征提取方法[J]. 计算机科学与探索, 2020, 14(3): 470-481.

[1]	WU Xiaodong, LIU Jinghao, JIN Jie, MAO Siping. DNN Intrusion Detection Model Based on DT and PCA [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(8): 1450-1458.
[2]	YANG Zhangjing, WANG Wenbo, HUANG Pu, ZHANG Fanlong. Denoising Latent Subspace Based Subspace Learning for Image Classification [J]. Journal of Frontiers of Computer Science and Technology, 2021, 15(12): 2374-2389.
[3]	SHI Na, XUE Hui, WANG Yunyun. Two-Phase Indefinite Kernel Support Vector Machine [J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(4): 598-605.
[4]	YANG Jie, TANG Yachun, TAN Daojun, LIU Xiaobing. Intrusion Detection Method of Multi-channel Autoencoder Deep Learning [J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(12): 2050-2060.
[5]	WANG Xiaodong, ZHAO Yining, XIAO Haili, WANG Xiaoning, CHI Xuebin. Research on Anomaly Detection System of Online Multi-node Log Flow [J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(11): 1828-1837.
[6]	WAN Jing, WU Fan, HE Yunbin, LI Song. Clustering Algorithm for High-Dimensional Data Under New Dimensionality Reduc-tion Criteria [J]. Journal of Frontiers of Computer Science and Technology, 2020, 14(1): 96-107.
[7]	XIE Juanying, HOU Qi, CAO Jiawen. Image Clustering Algorithms by Deep Convolutional Autoencoders [J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(4): 586-595.
[8]	YANG Shuai, HU Xuegang, ZHANG Yuhong. Multi-Marginalized Denoising Autoencoders for Domain Adaptation [J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(2): 322-329.
[9]	CHEN Deyun, FU Lijun, ZHANG Xuesong, YU Liang, CHEN Hailong, LI Ao. Multiple Representations for Image Classification Approaches [J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(12): 2138-2148.
[10]	LONG Tingyan, WAN Liang, DING Hongwei. Application Research of Autoencoder Network in Malicious JavaScript Code Detection [J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(12): 2073-2084.
[11]	LIANG Lingyu, SUN Mingkun, HE Wei, LI Fengrong. Head Pose Estimation Method of Bagging-SVM Integrated Classifier [J]. Journal of Frontiers of Computer Science and Technology, 2019, 13(11): 1935-1944.
[12]	LIU Xiaoyan, ZHANG Chengcheng, GUO Maozu, XING Linlin. Research on Transcriptional Regulatory Network Based on Combined Model [J]. Journal of Frontiers of Computer Science and Technology, 2018, 12(7): 1154-1161.
[13]	ZHUANG Fuzhen, LUO Dan HE Qing. Ensemble Local Representation Learning Based Recommendation Algorithm [J]. Journal of Frontiers of Computer Science and Technology, 2018, 12(6): 851-858.
[14]	XU Yi, DONG Qing, DAI Xin, SONG Wei. ELM Optimized Deep Autoencoder Classification Algorithm [J]. Journal of Frontiers of Computer Science and Technology, 2018, 12(5): 820-827.
[15]	YANG Zhangjing, ZHANG Fanlong, ZHANG Hui, YANG Guowei, LI Zuoyong, LUO Limin. Tri-Decomposition Model and Algorithm with Application in Image Recovery [J]. Journal of Frontiers of Computer Science and Technology, 2018, 12(12): 1940-1949.

Multi-Label Feature Extraction Method Relied on Feature-Label Dependence Auto-encoder

特征标记依赖自编码器的多标记特征提取方法

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 15

Recommended Articles

Metrics