光谱学与光谱分析, 2019, 39 (5): 1428, 网络出版: 2019-05-13   

窗口竞争性自适应重加权采样策略的近红外特征变量选择方法

A Variable Selection Approach of Near Infrared Spectra Based on Window Competitive Adaptive Reweighted Sampling Strategy
作者单位
1 湖南农业大学食品科学技术学院食品科学与生物技术湖南省重点实验室, 湖南 长沙 410128
2 上海烟草集团有限责任公司技术中心北京工作站, 北京 101121
摘要
通过消除光谱中的冗余信息变量, 挑选出代表样品性质的特征变量代替全谱建立定量模型, 可以提高近红外分析结果的准确性。 基于进化论中适者生存原理的竞争性自适应重加权采样(CARS)算法因具有计算速度快、 筛选得到的特征波长少等优点, 在近红外特征变量筛选方面得到了广泛的应用。 然而该方法在计算过程中容易出现校正集和验证集结果不一致情况。 这是因为算法过于强调校正集交叉验证结果, 且并未考虑相邻变量之间的协同作用。 为了建立更加稳健的变量筛选方法, 通过结合“窗口”以及CARS算法的优势, 提出了一种基于窗口竞争性自适应重加权采样(WCARS)策略的近红外特征变量筛选方法, 并将其应用于复杂植物样品近红外光谱与其化学成分含量之间的建模分析。 采用WCARS方法可以实现准确定量分析, 且通过与竞争性自适应重加权采样(CARS)方法结果相比较, WCARS方法得到的校正集和预测集结果一致, 在一定程度上减少了过拟合问题的出现。 该策略能有效增强特征变量选择的稳健性, 提高了定量模型的可信度, 具有一定的应用价值。
Abstract
Variable selection plays an important role in the quantitative analysis of near infrared spectra. The accuracy of near infrared spectroscopy can be improved by eliminating the redundant variables and selecting the characteristic variables. Competitive adaptive reweighted sampling (CARS) method is a newly developed strategy for wavelength selection by employing the principle “survival of the fittest” on which Darwin’s Evolution Theory is based. The number of selected wavelengths by CARS is much smaller than those of other methods with fast calculating speed and high accuracy. However, it is easy to get inconsistent results between the calibration and validation set due to the excessive attention on the cross validation results. In order to develop a robust variable selection method, by combining the advantages of CARS and “window”, a new tactic called window competitive adaptive reweighted sampling (WCARS) is employed to select characteristic variables and applied to the analysis of the near infrared spectra of the complex plant samples and the contents of the chemical components. Compared with the results of CARS method, accurate quantitative results can be obtained by the WCARS method. Furthermore, the results of correction set are consistent with those of the prediction set, and the problem of overfitting can be avoided. The results show that WCARS tactic can efficiently improve the accuracy and stability of variables selection and optimize the precision of prediction model, which has a certain application value.

李跑, 周骏, 蒋立文, 刘霞, 杜国荣. 窗口竞争性自适应重加权采样策略的近红外特征变量选择方法[J]. 光谱学与光谱分析, 2019, 39(5): 1428. LI Pao, ZHOU Jun, JIANG Li-wen, LIU Xia, DU Guo-rong. A Variable Selection Approach of Near Infrared Spectra Based on Window Competitive Adaptive Reweighted Sampling Strategy[J]. Spectroscopy and Spectral Analysis, 2019, 39(5): 1428.

本文已被 2 篇论文引用
被引统计数据来源于中国光学期刊网
引用该论文: TXT   |   EndNote

相关论文

加载中...

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!