Computer Integrated Manufacturing System ›› 2023, Vol. 29 ›› Issue (11): 3639-3655.DOI: 10.13196/j.cims.2021.0408

Previous Articles     Next Articles

Regularized LFDA algorithm based on density peak clustering

TAO Xinmin,WU Yongkang,BAO Yixuan,QI Lin,CHEN Wei,FAN Zhiting,HUANG Shan   

  1. College of Engineering and Technology,Northeast Forestry University
  • Online:2023-11-30 Published:2023-12-04
  • Supported by:
    Project supported by the National Natural Science Foundation,China (No.62176050),the Innovative Talent Fund of Harbin Science and Technology Bureau,China (No.2017RAXXJ018),and the Double First-Class Scientific Research Foundation of Northeast Forestry University,China (No.411112438).

基于密度峰值聚类的正则化LFDA算法

陶新民,吴永康,包艺璇,祁霖,陈玮,范芷汀,黄珊   

  1. 东北林业大学工程技术学院
  • 基金资助:
    国家自然科学基金资助项目(62176050);哈尔滨科技局创新人才基金资助项目(2017RAXXJ018);东北林业大学双一流科研启动资金资助项目(411112438)。

Abstract: Considering that the existing Fisher Discriminant Analysis (FDA) and its improved algorithms cannot effectively use both labeled and unlabeled data for learning,a Regularized LFDA algorithm based on Density Peak Clustering (DPC-RLFDA) was proposed.Two regularization terms were constructed using pseudo-labels derived from the density peak clustering technique to regularize the within-class scatter matrix and between-class scatter matrix of local FDA,and then the optimal projection vector was obtained by solving the objective function.In addition,to accommodate for non-linear and non-Gaussian distributed datasets,a kernel-based DPC-RLFDA was proposed.The experimental results on artificial datasets and UCI datasets showed that compared with the FDA and its improved algorithm,the discriminant performance of the proposed algorithm was significantly improved.

Key words: dimension reduction, feature extraction, Fisher discriminant analysis, density peak clustering

摘要: 考虑到现有费舍尔判别分析(FDA)及其改进算法无法同时有效利用有标签数据和无标签数据进行学习,提出一种基于密度峰值聚类的正则化局部费舍尔判别分析(DPC-RLFDA)算法。该算法首先利用密度峰值聚类算法得到的伪标签构造两个正则化项来规范局部FDA的类间散度矩阵和类内散度矩阵;然后通过求解目标函数得到最优投影向量。此外,为适用于非线性非高斯分布数据集,提出了基于核的DPC-RLFDA。在人工数据集和UCI数据集上的实验结果表明,与FDA及其改进算法相比,所提算法的判别性能得到了显著提升。

关键词: 降维, 特征提取, 费舍尔判别分析, 密度峰值聚类

CLC Number: