光学学报, 2014, 34 (9): 0930001, 网络出版: 2020-05-22   

基于潜在语义分析与NIR的中药材分类研究

Classification Research of Chinese Medicine Based on Latent Semantic Analysis and NIR
作者单位
华中农业大学工学院, 湖北 武汉 430070
摘要
基于近红外光谱(NIR)和潜在语义分析(LSA)方法, 对5种典型壮阳中药材进行分类鉴别研究。利用潜在语义分析对光谱预处理后的5种壮阳中药材光谱数据进行特征提取和鉴别分类后, 将经光谱预处理和主成分分析(PCA)提取特征后的光谱特征数据分别带入K近邻(KNN)、BP神经网络(BP-ANN)和偏最小二乘支持向量机(LSSVM)三种典型的分类模型进行分类, 并将结果与潜在语义分析模型结果进行对比。 在4119.20~9881.46 cm-1波数范围内, NIR光谱数据经多元散射校正(MSC)预处理后, 代入潜在语言空间维数为3时所建立的LSA分类模型, 训练集和测试集准确率均达到了100%。 结果表明, 在壮阳类中药材的近红外光谱分析鉴别中, 潜在语义分析可以作为一种全新的提取光谱信息并分类的方法, 具有较好的运用前景和实际意义。
Abstract
Five kinds of typical Yang-boosting Chinese herbal medicine are identified and classified based on near infrared spectroscopy (NIR) and latent semantic analysis (LSA) methods. Latent semantic analysis is used for characteristic extraction and classification of preprocessed spectral data of 5 kinds of Yang-boosting Chinese herbal medicine. The spectral characteristic data, after spectral pretreating and characteristic extraction by principal component analysis (PCA), are respectively subjected into the K-nearest neighbor (KNN), BP-artifical neural networks (BP-ANN) and least squares support vector machine (LS-SVM) classification models whose results then are compared with the result of latent semantic analysis model. In the characteristic wavenumber range of 4119.20~9881.46 cm-1, spectral data pretreated by multiplicative scatter correction (MSC) are substituted to LSA classification model when spacing dimension of underlying language is 3, and accuracy rates of both training set and test set are 100%. The results show that latent semantic analysis, which has a good application prospect and practical significance, can be used as a new method for spectral information extraction and classification in the near-infrared spectroscopy identification of Yang-boosting Chinese herbal medicine.
参考文献

[1] 李经纬. 中医大辞典[M]. 北京: 人民卫生出版社, 1995. 712-712.

    Li Jingwei. Dictionary of Chinese Medicine [M]. Beijing: People′s Medical Publishing House, 1995. 712-712.

[2] 胡咏川, 田晓鑫, 刘蕾, 等. 近红外光谱技术鉴定中药的进展[J]. 中国中药杂志, 2012, 37(8): 1066-1071.

    Hu Yongchuan, Tian Xiaoxin, Liu Lei, et al.. Advances in identification of Chinese medicines by NIRS [J]. China Journal of Chinese Materia Medica, 2012, 37(8): 1066-1071.

[3] 高鸿彬, 刘浩, 相秉仁. 半夏及其伪品天南星的近红外漫反射快速无损鉴别[J]. 光谱实验室, 2012, 29(2): 899-902.

    Gao Hongbin, Liu Hao, Xiang Bingren. Rapid and nondestructive identification of pinellia rhizome and its pseudo-product arisaema rhizome by near-infrared diffuse reflectance spectrometry [J]. Chinese Journal of Spectroscopy Laboratary, 2012, 29(2): 899-902.

[4] 龚海燕, 白雁, 宋瑞丽, 等. 近红外光谱结合聚类分析鉴别铁棍山药和白玉山药[J]. 中国医院药学杂志, 2010, 30(9): 735-737.

    Gong Haiyan, Bai Yan, Song Ruili, et al.. The discrimination of tigun yam and baiyu yam using near infrared spectroscopy [J]. Chinese Journal of Hospital Pharmacy, 2010, 30(9): 735-737.

[5] 杜敏, 巩颖, 林兆洲, 等. 样品表面近红外光谱结合多类支持向量机快速鉴别枸杞子产地[J]. 光谱学与光谱分析, 2013, 33(5): 1211-1214.

    Du Min, Gong Yin, Lin Zhaozhou, et al.. Rapid identification of wolf berry fruit of different geographic regions with sample surface near infrared spectra combined with multi-class SVM [J]. Spectroscopy and Spectral Analysis, 2013, 33(5): 1211-1214.

[6] 张静, 耿志鹏, 范刚, 等. 近红外光谱技术测定黄连中6种生物碱含量的新方法[J]. 时珍国医国药, 2011, 22(10): 2393-2394.

    Zhang Jing, Geng Zhipeng, Fan Gang, et al.. A new method for analysis of six akaloids in coptis chinensis franch by near infrared diffuse reflectance spectroscopy [J]. Lishizhen Medicine and Materia Medica Research, 2011, 22(10): 2393-2394.

[7] 蒋建洪, 罗玫. 在线商品的潜在语义信息提取及分类研究[J]. 计算机与数字工程, 2014, 42(1): 112-115.

    Jiang Jianhong, Luo Mei. Latent semantic information extraction and classification of online product [J]. Computer & Digital Engineering, 2014, 42(1): 112-115.

[8] 刘博. 潜在语义索引在中文信息检索中的应用[D]. 北京: 北京邮电大学, 2009. 10-17.

    Liu Bo. Application of Latent Semantic Indexing Chinese Information Retrieve [D]. Beijing: Beijing University of Posts and Telecommunications, 2009. 10-17.

[9] 简艳. 基于潜在语义的中文文本聚类及其应用[D]. 沈阳: 东北大学, 2008. 5-12.

    Jian Yan. Chinese Text Clustering Based on Latent Semantic and Its Applications [D]. Shenyang: Northeastern University, 2008, 5-12.

[10] 龙长江, 万鹏. 近红外检测技术在中药研究中的应用[C]. 中国农业工程学会2011年学术年会论文集, 2011. 1704-1707.

[11] 陈洁华. 潜在语义分析理论研究及其应用[D]. 上海: 上海大学, 2005. 12-21.

    Chen Jiehua. Theory and Application of Latent Semantic Analysis [D]. Shanghai: Shanghai University, 2005. 12-21.

[12] 郭培源, 林岩, 付妍, 等. 基于近红外光谱技术的猪肉新鲜度等级研究[J]. 激光与光电子学进展, 2013, 50(3): 033002.

    Guo Peiyuan, Lin Yan, Fu Yan, et al.. Research on freshness level of meat based on near-infrared spectroscopic technique [J]. Laser & Optoelectronics Progress, 2013, 50(3): 033002.

[13] 赵杰文, 蒋培, 陈全胜. 雪莲花产地鉴别的近红外光谱分析方法[J]. 农业机械学报, 2010, 41(8): 111-114.

    Zhao Jiewen, Jiang Pei, Chen Quansheng. Discrimination of snow lotus from different geographical origins by near infrared spectroscopy [J]. Transactions of the Chinese Society for Agricultural Machinery, 2010, 41(8): 111-114.

[14] 牛晓颖, 邵利敏, 赵志磊, 等. 基于BP-ANN的草莓品种近红外光谱无损鉴别方法研究[J]. 光谱学与光谱分析, 2012, 32(8): 2095-2099.

    Niu Xiaoying, Shao Limin, Zhao Zhilei, et al.. Nondestructive discrimination of strawberry varieties by NIR and BP-ANN [J]. Spectroscopy and Spectral Analysis, 2012, 32(8): 2095-2099.

[15] 蒋诗泉, 周兴才, 蒋诗平. 基于PCA和LS-SVM的傅里叶变换近红外光谱的黄酒酒龄的鉴别模型[J]. 光谱实验室, 2012, 29(2): 806-811.

    Jiang Shiquan, Zhou Xingcai, Jiang Shiping. Discriminative model for FTNIS analysis on age of Shaoxing rice wine based on PCA and LS-SVM [J]. Chinese Journal of Spectroscopy Laboratory, 2012, 29(2): 806-811.

[16] 邵咏妮, 曹芳, 何勇. 基于独立组分分析和BP神经网络的可见/近红外光谱稻谷年份的鉴别[J]. 红外与毫米波学报, 2007, 26(6): 433-436.

    Shao Yongni, Cao Fang, He Yong. Discrimination years of rough rice by using visible/near infrared spectroscopy based on independent component analysis and BP neural network [J]. Journal of Infrared and Millimeter Waves, 2007, 26(6): 433-436.

[17] Q Chen, J Zhao, H Lin. Study on discrimination of Roast green tea (Camellia simensis L.) according to geographical origin by FT-NIR spectroscopy and supervised pattern recognition [J]. Spectrochim Acta Part (A), 2009, 72(4): 845-850.

陈晓峰, 龙长江, 牛智有, 朱凯. 基于潜在语义分析与NIR的中药材分类研究[J]. 光学学报, 2014, 34(9): 0930001. Chen Xiaofeng, Long Changjiang, Niu Zhiyou, Zhu Kai. Classification Research of Chinese Medicine Based on Latent Semantic Analysis and NIR[J]. Acta Optica Sinica, 2014, 34(9): 0930001.

本文已被 5 篇论文引用
被引统计数据来源于中国光学期刊网
引用该论文: TXT   |   EndNote

相关论文

加载中...

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!