激光与光电子学进展, 2021, 58 (3): 0330001, 网络出版: 2021-03-12  

基于混合机器学习法的太赫兹波鉴别草种的研究 下载: 582次

Identification of a Grass Species Using a Terahertz Wave Based on Hybrid Machine Learning Method
作者单位
1 中国石油大学(北京)理学院,北京 102249
2 内蒙古自治区草原工作站,内蒙古 呼和浩特 010020
摘要
利用太赫兹时域光谱技术对黄耆类牧草种子样品进行测试,得到5种常见沙打旺牧草种子在0.2~1.2 THz有效频率范围内的太赫兹时域谱,然后通过快速傅里叶变换得到了各牧草种子样品的吸收系数、折射率等光学参数。研究后发现:在有效频率范围内,样品时域谱的峰值强度和延迟时间均不同,且每条谱线的平均吸收系数和标准差也有明显差异,各样品的平均折射率也有较大差异。同时,本文提出了一种将主成分分析(PCA)与随机森林(RF)机器学习算法相结合的优化实验数据的混合模型PCA-RF,并基于太赫兹折射率谱,采用PCA-RF模型和RF模型对5种牧草种子的200个数据集进行了统计计算。结果表明:混合模型PCA-RF的平均分类准确率为91.20%;与RF模型相比,不管是总的平均分类准确率,还是每种样品的分类准确率,PCA-RF模型都优于RF模型。研究结果表明,太赫兹时域光谱技术结合混合机器学习算法的PCA-RF模型是一种无损鉴定牧草种子真伪的有效手段,可用于鉴别同族且差异较小的牧草品种。
Abstract
In this study, the terahertz time-domain spectroscopy (THz-TDS) technology was used to conduct experimental tests on seed samples of astragalus japonica. We obtained the terahertz time-domain spectra of five kinds of Astragalus adsurgens Pall. seeds in the effective frequency range of 0.2-1.2 THz, and used the fast Fourier transform analysis to study the optical parameters such as the absorption coefficient and refractive index of each grass-seed sample. It was found that in the effective frequency range, the peak intensity and delay time of the time-domain spectrum of the samples were different, and the average absorption coefficient and standard deviation of each spectrum line were significantly different. In addition, the average refractive index of the samples was significantly different. At the same time, this study proposes a hybrid model of optimized experimental data that combines principal component analysis (PCA) with random forest machine learning (RF). Based on the terahertz refractive index spectrum, 200 datasets of five forage species were statistically calculated, and the calculated results were compared with the calculated results of the RF model. The results show that the average classification accuracy of principal component analysis-random forest (PCA-RF) in the mixed model is 91.20%. Compared with the RF model, both total average classification accuracy and the classification accuracy of each sample of the PCA-RF model are better than those of the RF model. The study shows that the PCA-RF model combining THz-TDS with the hybrid machine learning algorithm can be used as an effective method for the nondestructive identification of the authenticity of forage grass seeds. In particular, it can be used for the classification of forage grass varieties of the same family with little difference.

王芳, 张春红, 赵景峰, 哈斯巴特尔, 张玉. 基于混合机器学习法的太赫兹波鉴别草种的研究[J]. 激光与光电子学进展, 2021, 58(3): 0330001. Wang Fang, Zhang Chunhong, Zhao Jingfeng, Ha Sibateer, Zhang Yu. Identification of a Grass Species Using a Terahertz Wave Based on Hybrid Machine Learning Method[J]. Laser & Optoelectronics Progress, 2021, 58(3): 0330001.

引用该论文: TXT   |   EndNote

相关论文

加载中...

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!