首页 > 论文 > 光学学报 > 34卷 > 9期(pp:0930001--1)

基于潜在语义分析与NIR的中药材分类研究

Classification Research of Chinese Medicine Based on Latent Semantic Analysis and NIR

  • 摘要
  • 论文信息
  • 参考文献
  • 被引情况
  • PDF全文
分享:

摘要

基于近红外光谱(NIR)和潜在语义分析(LSA)方法, 对5种典型壮阳中药材进行分类鉴别研究。利用潜在语义分析对光谱预处理后的5种壮阳中药材光谱数据进行特征提取和鉴别分类后, 将经光谱预处理和主成分分析(PCA)提取特征后的光谱特征数据分别带入K近邻(KNN)、BP神经网络(BP-ANN)和偏最小二乘支持向量机(LSSVM)三种典型的分类模型进行分类, 并将结果与潜在语义分析模型结果进行对比。 在4119.20~9881.46 cm-1波数范围内, NIR光谱数据经多元散射校正(MSC)预处理后, 代入潜在语言空间维数为3时所建立的LSA分类模型, 训练集和测试集准确率均达到了100%。 结果表明, 在壮阳类中药材的近红外光谱分析鉴别中, 潜在语义分析可以作为一种全新的提取光谱信息并分类的方法, 具有较好的运用前景和实际意义。

Abstract

Five kinds of typical Yang-boosting Chinese herbal medicine are identified and classified based on near infrared spectroscopy (NIR) and latent semantic analysis (LSA) methods. Latent semantic analysis is used for characteristic extraction and classification of preprocessed spectral data of 5 kinds of Yang-boosting Chinese herbal medicine. The spectral characteristic data, after spectral pretreating and characteristic extraction by principal component analysis (PCA), are respectively subjected into the K-nearest neighbor (KNN), BP-artifical neural networks (BP-ANN) and least squares support vector machine (LS-SVM) classification models whose results then are compared with the result of latent semantic analysis model. In the characteristic wavenumber range of 4119.20~9881.46 cm-1, spectral data pretreated by multiplicative scatter correction (MSC) are substituted to LSA classification model when spacing dimension of underlying language is 3, and accuracy rates of both training set and test set are 100%. The results show that latent semantic analysis, which has a good application prospect and practical significance, can be used as a new method for spectral information extraction and classification in the near-infrared spectroscopy identification of Yang-boosting Chinese herbal medicine.

Newport宣传-MKS新实验室计划
补充资料

中图分类号:O657.3

DOI:10.3788/aos201434.0930001

所属栏目:光谱学

基金项目:国家自然科学基金(61007058)、中央高校基本科研业务费专项基金(2014JC001)

收稿日期:2014-04-01

修改稿日期:2014-05-07

网络出版日期:--

作者单位    点击查看

陈晓峰:华中农业大学工学院, 湖北 武汉 430070
龙长江:华中农业大学工学院, 湖北 武汉 430070
牛智有:华中农业大学工学院, 湖北 武汉 430070
朱凯:华中农业大学工学院, 湖北 武汉 430070

联系人作者:龙长江(lcjflow@163.com)

备注:陈晓峰(1988-), 男, 硕士研究生, 主要从事近红外检测方面的研究。E-mail: charmy_cxf@163.com

【1】Li Jingwei. Dictionary of Chinese Medicine [M]. Beijing: People′s Medical Publishing House, 1995. 712-712.
李经纬. 中医大辞典[M]. 北京: 人民卫生出版社, 1995. 712-712.

【2】Hu Yongchuan, Tian Xiaoxin, Liu Lei, et al.. Advances in identification of Chinese medicines by NIRS [J]. China Journal of Chinese Materia Medica, 2012, 37(8): 1066-1071.
胡咏川, 田晓鑫, 刘蕾, 等. 近红外光谱技术鉴定中药的进展[J]. 中国中药杂志, 2012, 37(8): 1066-1071.

【3】Gao Hongbin, Liu Hao, Xiang Bingren. Rapid and nondestructive identification of pinellia rhizome and its pseudo-product arisaema rhizome by near-infrared diffuse reflectance spectrometry [J]. Chinese Journal of Spectroscopy Laboratary, 2012, 29(2): 899-902.
高鸿彬, 刘浩, 相秉仁. 半夏及其伪品天南星的近红外漫反射快速无损鉴别[J]. 光谱实验室, 2012, 29(2): 899-902.

【4】Gong Haiyan, Bai Yan, Song Ruili, et al.. The discrimination of tigun yam and baiyu yam using near infrared spectroscopy [J]. Chinese Journal of Hospital Pharmacy, 2010, 30(9): 735-737.
龚海燕, 白雁, 宋瑞丽, 等. 近红外光谱结合聚类分析鉴别铁棍山药和白玉山药[J]. 中国医院药学杂志, 2010, 30(9): 735-737.

【5】Du Min, Gong Yin, Lin Zhaozhou, et al.. Rapid identification of wolf berry fruit of different geographic regions with sample surface near infrared spectra combined with multi-class SVM [J]. Spectroscopy and Spectral Analysis, 2013, 33(5): 1211-1214.
杜敏, 巩颖, 林兆洲, 等. 样品表面近红外光谱结合多类支持向量机快速鉴别枸杞子产地[J]. 光谱学与光谱分析, 2013, 33(5): 1211-1214.

【6】Zhang Jing, Geng Zhipeng, Fan Gang, et al.. A new method for analysis of six akaloids in coptis chinensis franch by near infrared diffuse reflectance spectroscopy [J]. Lishizhen Medicine and Materia Medica Research, 2011, 22(10): 2393-2394.
张静, 耿志鹏, 范刚, 等. 近红外光谱技术测定黄连中6种生物碱含量的新方法[J]. 时珍国医国药, 2011, 22(10): 2393-2394.

【7】Jiang Jianhong, Luo Mei. Latent semantic information extraction and classification of online product [J]. Computer & Digital Engineering, 2014, 42(1): 112-115.
蒋建洪, 罗玫. 在线商品的潜在语义信息提取及分类研究[J]. 计算机与数字工程, 2014, 42(1): 112-115.

【8】Liu Bo. Application of Latent Semantic Indexing Chinese Information Retrieve [D]. Beijing: Beijing University of Posts and Telecommunications, 2009. 10-17.
刘博. 潜在语义索引在中文信息检索中的应用[D]. 北京: 北京邮电大学, 2009. 10-17.

【9】Jian Yan. Chinese Text Clustering Based on Latent Semantic and Its Applications [D]. Shenyang: Northeastern University, 2008, 5-12.
简艳. 基于潜在语义的中文文本聚类及其应用[D]. 沈阳: 东北大学, 2008. 5-12.

【10】龙长江, 万鹏. 近红外检测技术在中药研究中的应用[C]. 中国农业工程学会2011年学术年会论文集, 2011. 1704-1707.

【11】Chen Jiehua. Theory and Application of Latent Semantic Analysis [D]. Shanghai: Shanghai University, 2005. 12-21.
陈洁华. 潜在语义分析理论研究及其应用[D]. 上海: 上海大学, 2005. 12-21.

【12】Guo Peiyuan, Lin Yan, Fu Yan, et al.. Research on freshness level of meat based on near-infrared spectroscopic technique [J]. Laser & Optoelectronics Progress, 2013, 50(3): 033002.
郭培源, 林岩, 付妍, 等. 基于近红外光谱技术的猪肉新鲜度等级研究[J]. 激光与光电子学进展, 2013, 50(3): 033002.

【13】Zhao Jiewen, Jiang Pei, Chen Quansheng. Discrimination of snow lotus from different geographical origins by near infrared spectroscopy [J]. Transactions of the Chinese Society for Agricultural Machinery, 2010, 41(8): 111-114.
赵杰文, 蒋培, 陈全胜. 雪莲花产地鉴别的近红外光谱分析方法[J]. 农业机械学报, 2010, 41(8): 111-114.

【14】Niu Xiaoying, Shao Limin, Zhao Zhilei, et al.. Nondestructive discrimination of strawberry varieties by NIR and BP-ANN [J]. Spectroscopy and Spectral Analysis, 2012, 32(8): 2095-2099.
牛晓颖, 邵利敏, 赵志磊, 等. 基于BP-ANN的草莓品种近红外光谱无损鉴别方法研究[J]. 光谱学与光谱分析, 2012, 32(8): 2095-2099.

【15】Jiang Shiquan, Zhou Xingcai, Jiang Shiping. Discriminative model for FTNIS analysis on age of Shaoxing rice wine based on PCA and LS-SVM [J]. Chinese Journal of Spectroscopy Laboratory, 2012, 29(2): 806-811.
蒋诗泉, 周兴才, 蒋诗平. 基于PCA和LS-SVM的傅里叶变换近红外光谱的黄酒酒龄的鉴别模型[J]. 光谱实验室, 2012, 29(2): 806-811.

【16】Shao Yongni, Cao Fang, He Yong. Discrimination years of rough rice by using visible/near infrared spectroscopy based on independent component analysis and BP neural network [J]. Journal of Infrared and Millimeter Waves, 2007, 26(6): 433-436.
邵咏妮, 曹芳, 何勇. 基于独立组分分析和BP神经网络的可见/近红外光谱稻谷年份的鉴别[J]. 红外与毫米波学报, 2007, 26(6): 433-436.

【17】Q Chen, J Zhao, H Lin. Study on discrimination of Roast green tea (Camellia simensis L.) according to geographical origin by FT-NIR spectroscopy and supervised pattern recognition [J]. Spectrochim Acta Part (A), 2009, 72(4): 845-850.

您的浏览器不支持PDF插件,请使用最新的(Chrome/Fire Fox等)浏览器.或者您还可以点击此处下载该论文PDF