光谱学与光谱分析, 2010, 30 (5): 1214, 网络出版: 2011-01-26  

岭回归在近红外光谱定量分析及最优波长选择中的应用研究

Study on the Application of Ridge Regression to Near-Infrared Spectroscopy Quantitative Analysis and Optimum Wavelength Selection
作者单位
1 中国农业大学理学院, 北京100083
2 中国农业大学信息与电气工程学院, 北京100083
摘要
以66个小麦样品为试验材料, 研究岭回归方法在近红外光谱定量分析中的应用。 用44个小麦样品的近红外光谱数据建立测定蛋白质含量的近红外-岭回归模型, 预测其余22个小麦样品的蛋白质含量。 预测结果与凯氏定氮法分析结果(化学分析值)的平均相对误差为1.518%, 与偏最小二乘法(PLS)预测结果进行比较, 显示岭回归方法可用于近红外光谱定量分析; 进一步, 为了减少无关信息对定量分析模型预测能力的干扰, 一种有效的方法就是进行波长信息的选择。 从1 297个波长点中优选出4个波长点, 利用这4个波长点处的光谱信息建立近红外-岭回归模型预测22个样品的蛋白质含量, 预测结果与凯氏定氮法分析结果之间的平均相对误差为1.37%, 相关系数达到0.981 7。 结果表明岭回归方法从大量光谱信息中筛选出了最重要的波长信息、 不仅简化了模型, 有效的减少了光谱信息共线性的干扰, 而且对特定分析选择出适用的波长对指导设计专用近红外定量分析仪器亦有实际意义。
Abstract
In the present paper, taking 66 wheat samples for testing materials, ridge regression technology in near-infrared (NIR) spectroscopy quantitative analysis was researched. The NIR-ridge regression model for determination of protein content was established by NIR spectral data of 44 wheat samples to predict the protein content of the other 22 samples. The average relative error was 0.015 18 between the predictive results and Kjeldahl’s values (chemical analysis values). And the predictive results were compared with those values derived through partial least squares (PLS) method, showing that ridge regression method was deserved to be chosen for NIR spectroscopy quantitative analysis. Furthermore, in order to reduce the disturbance to predictive capacity of the quantitative analysis model resulting from irrelevant information, one effective way is to screen the wavelength information. In order to select the spectral information with more content information and stronger relativity with the composition or the nature of the samples to improve the model’s predictive accuracy, ridge regression was used to select wavelength information in this paper. The NIR-ridge regression model was established with the spectral information at 4 wavelength points, which were selected from 1 297 wavelength points, to predict the protein content of the 22 samples. The average relative error was 0.013 7 and the correlation coefficient reached 0.981 7 between the predictive results and Kjeldahl’s values. The results showed that ridge regression was able to screen the essential wavelength information from a large amount of spectral information. It not only can simplify the model and effectively reduce the disturbance resulting from collinearity information, but also has practical significance for designing special NIR analysis instrument for analyzing specific component in some special samples.
参考文献

[1] SUN Tong, XU Hui-rong, YING Yi-bin(孙通, 徐惠荣, 应义斌). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2009, 29(1): 122.

[2] SU Qing-quan, ZHANG Jin-sheng, GUO Qi (苏清泉, 张金生, 郭奇). Contemporary Chemical Industry (当代化工), 2007, 36(6): 652.

[3] ZHOU Shu-ping, CHENG Gui-min, LI Wei-hong, et al(周淑平, 程贵敏, 李卫红, 等). Guizhou Agricultural Sciences(贵州农业科学), 2007, 35(1): 28.

[4] ZHANG Hui, SONG Yan, LENG Jing, et al(张卉, 宋妍, 冷静, 等). Chinese Journal of Spectroscopy Laboratory(光谱实验室), 2007, 24(3): 388.

[5] ZHU Ju-hui, LIU Tong-bin, ZHAO Jin-zhou, et al(朱炬辉, 刘同斌, 赵金洲, 等). Natural Gas Exploration and Development(天然气勘探与开发), 2007, 30(4): 37, 85.

[6] SHENG Hu, YANG Jing-shu(盛琥, 杨景曙). Computer Engineering and Applications(计算机工程与应用), 2008, 44(19): 213.

[7] Kassa L D. South African Journal of Plant and Soil, 2008, 25 (3): 135.

[8] Huang D, Guan P, Guo J, et al. BMC Infectious Diseases, 2008, 8: 130.

[9] LIU Guang-li(刘广丽). Finance & Economy(金融经济), 2007(16): 107.

[10] CHEN Bin, CHEN Dan(陈斌, 陈蛋). Spectronic Instruments and Analysis(光谱仪器与分析), 2005(4): 26.

[11] Zhou X, Xiang B, Wan Z, et al. Analytical Letters, 2007, 40(18): 3383.

[12] LIU Hui-jun, LIN Min, SHI Yang, et al(刘辉军, 林敏, 施秧, 等). Physical Testing and Chemical Analysis Part B (Chemical Analysis)(理化检验-化学分册), 2008, 44(3): 249.

[13] ZHANG Qiao-jie, WANG Yi-ming, WU Jing-zhu(张巧杰, 王一鸣, 吴静珠). Journal of China Agricultural University(中国农业大学学报), 2006, 11(2): 74.

[14] CHEN Xiao-jing, WU Di, YU Jia-jia, et al(陈孝敬, 吴迪, 虞佳佳, 等). Acta Optica Sinica(光学学报), 2008, 28(11): 2153.

[15] FANG Kai-tai, QUAN Hui, CHEN Qing-yun(方开泰, 全辉, 陈庆云). Practical Regression Analysis(实用回归分析). Beijing: Science Press(北京: 科学出版社), 1988. 274.

[16] HE Xiu-li, WANG Hao-hua(何秀丽, 王浩华). Journal of Gansu Lianhe University·Natural Science Edition(甘肃联合大学学报·自然科学版), 2008, 22(6): 1.

张曼, 刘旭华, 何雄奎, 张录达, 赵龙莲, 李军会. 岭回归在近红外光谱定量分析及最优波长选择中的应用研究[J]. 光谱学与光谱分析, 2010, 30(5): 1214. ZHANG Man, LIU Xu-hua, HE Xiong-kui, ZHANG Lu-da, ZHAO Long-lian, LI Jun-hui. Study on the Application of Ridge Regression to Near-Infrared Spectroscopy Quantitative Analysis and Optimum Wavelength Selection[J]. Spectroscopy and Spectral Analysis, 2010, 30(5): 1214.

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!