光学 精密工程, 2014, 22 (7): 1921, 网络出版: 2014-09-01  

基于监督保局子空间虚假近邻准则的原始特征选择

Original feature selection based on false nearest neighbor criterion in supervised locality preserving subspace
作者单位
1 重庆科技学院 电气与信息工程学院, 重庆 401331
2 重庆大学 光电技术及系统教育部重点实验室, 重庆 400044
摘要
提出一种基于监督保局投影(SLPP)与虚假最近邻(FNN)准则的原始特征选择方法。该方法首先将非线性原始数据映射到监督保局子空间, 消除样本数据输入变量之间的相关性; 然后, 利用虚假近邻点方法计算剔除每个原始特征前后输入样本在监督保局子空间里的相似性测度, 获得每个原始特征对类别变量不同程度的解释力; 最后, 从全特征开始逐步剔除解释能力弱的特征进而获得多组特征子集, 并建立最近邻分类器, 识别率最高且含特征数最少的特征子集即为最优特征子集。采用合成数据对该方法进行了仿真验证, 结果表明, 该方法可获得与数据集本质分类特征吻合的最佳特征子集。将该方法应用于选择真实的低阻油气层特征, 获得的最佳特征子集比全特征集合的特征数量减少了50%以上, 分类识别率高出8%。结果显示该方法具有优秀的原始特征选择能力, 是一种有效的非线性特征选择方法。
Abstract
A novel method based on Supervised Locality Preserving Projection (SLPP) and False Nearest Neighbor (FNN) was proposed for selecting the most proper feature for nonlinear pattern classification.In the proposed method, nonlinear original data were mapped to the supervised locality preserving subspace to eliminate the existing multi-collinearity among the features. Then, the interpretation capability for original features was estimated through calculating the variable mapping distance in the supervised locality preserving subspace. The nearest neighbor classifier based on each subset obtained by eliminating weak features successively was constructed. Finally, the optimal feature subset was selected corresponding to the highest recognition accuracy and the least number of features.The experiment on synthetic dataset shows that the proposed method can obtain an optimal feature subset containing the essential features in accordance with the classification goal. The method was used to select the features of low resistivity hydrocarbon reservoir, and the result indicates that the obtained optimal feature subset contains over 50% less feature and achieves 8% higher recognition accuracy as compared to that of the all-feature set. These results validate that the proposed method can offer excellent abilities of original feature selection and nonlinear feature selection.
参考文献

[1] JUNG C S, SEO H, KANG H G. Estimating redundancy information of selected features in multi-dimensional pattern classification[J].Pattern Recognition Letters, 2011, 32(4): 590-596.

[2] 王灿进, 孙涛, 王挺峰, 等. 基于轮廓特征的神经网络目标识别研究[J]. 液晶与显示, 2013,28(4):641-648.

    WANG C J,SUN T,WANG T F,et al..Target recognition using Neural Network based on Contour features [J].Chinese Journal of Liquid Crystals and Displays,2013,28(4):641-648.(in Chinese)

[3] 杨云, 岳柱.基于融合图像轮廓和Harris角点方法的遮挡人体目标识别研究[J].液晶与显示,2013,28(2):273-277.

    YANG Y,YUE ZH. Human body target recognition under occlusion based on fusion of image Contour moment and Harris angular points[J]. Chinese Journal of Liquid Crystals and Displays,2013,28(2):273-277.

[4] CHO H W.Nonlinear feature extraction and classification of multivariate data in Kernel feature space [J].Expert Systems with Applications, 2007, 32(2): 534-542.

[5] ZHANG Y W.Enhanced statistical analysis of nonlinear processes using KPCA, KICA and SVM[J].Chemical Engineering Science, 2009, 64(5):801-811.

[6] WIDODO A, KIM E Y, SON J D, et al.. Fault diagnosis of low speed bearing based on relevance vector machine and support vector machine[J].Expert Systems with Applications, 2009, 36(3): 7252-7261.

[7] SAKAR C O, KURSUN O, GURGEN F. A feature selection method based on kernel canonical correlation analysis and the minimum Redundancy-Maximum Relevance filter method[J].Expert Systems with Applications, 2012, 39(3): 3432-3437.

[8] ZHANG Y W, TENG Y D. Process data modeling using modified kernel partial least squares[J].Chemical Engineering Science, 2010, 65(24): 6353-6361.

[9] 杨静宇, 金钟, 杨健. 模式特征抽取研究进展[C]. 2009中国自动化大会暨两化融合高峰会议, 2009, 杭州.

    YANG J Y, JIN ZH, YANG J. The research progress of pattern feature extraction[C]. CAAC2009, 2009, Hang Zhou.(in Chinese)

[10] TENENBAUM J B, VIN de SILVA, LANGFORD J C. A global geometric framework for nonlinear dimensionality reduction [J].Science, 2000, 290: 2319-2323.

[11] ROWEIS S T, SAUL L K. Nonlinear dimensi-onality reduction by locally linear embedding [J].Science, 2000, 290: 2323-2326.

[12] BELKIN M, NIYOGI P.Laplacian eigenmaps and spectral techniques for embedding and clustering[C]. Proceedings of Conference on Advances in Neural Information Processing system 15, 2001.

[13] PARKH. ISOMAP induced manifold embedding and its application to Alzheimers disease and mild cognitive impairment[J].Neuroscience Letters, 2012, 513(2): 141-145.

[14] QIAO H, ZHANG P, WANG D. An explicit nonlinear mapping for manifold learning[J].IEEE Transactions on Cybernetics, 2013, 43(1): 51-63.

[15] ZHANG J P J, HUANG H, WANG J. Manifold learning for visualizing and analyzing high-dimensional data [J]. IEEE Intelligent Systems, 2010, 25(4): 54-61.

[16] 黄鸿, 杨媚, 张满菊. 基于稀疏鉴别嵌入的高光谱遥感影像分类[J].光学精密工程, 2013, 21(11):2922-2930.

    HUANG H, YANG M, ZHANG M J. Semi-supervised manifold learning and its application to remote sensing image classification [J].Opt. Precision Eng., 2013, 21(11):2922-2930. (in Chinese)

[17] KENNEL M B, BROWN R, ABARBANEL H D I. Determining embedding dimension for phase space reconstruction using a geometrical reconstruction[J].Phys Rev A, 1992, 45:3403-3411.

[18] 李太福, 易军, 苏莹莹, 等. 基于KPCA子空间虚假邻点判别的非线性建模的变量选择[J]. 机械工程学报, 2012, 48(10):192-198.

    LI T F, YI J, SU Y Y, et al..Variable selection for nonlinear modeling based on false nearest neighbours in KPCA subspace[J].Journal of Mechanical Engineering, 2012, 48(10):192-198. (in Chinese)

[19] HE X F, YAN S C, HU Y X, et al.. Face recognition using Laplacianfaces [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005, 27(3): 328-340.

[20] 辜小花, 龚卫国, 杨利平.有监督图优化保局投影[J].光学精密工程, 2011, 19(3): 672-680.

    GU X H, GONG W G, YANG L P.Supervised graph-optimized locality preserving projections [J].Opt. Precision Eng., 2011, 19(3):672-680.(in Chinese)

辜小花, 李太福, 杨利平, 易军, 周伟. 基于监督保局子空间虚假近邻准则的原始特征选择[J]. 光学 精密工程, 2014, 22(7): 1921. GU Xiao-hua, LI Tai-fu, YANG Li-ping, YI Jun, ZHOU Wei. Original feature selection based on false nearest neighbor criterion in supervised locality preserving subspace[J]. Optics and Precision Engineering, 2014, 22(7): 1921.

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!