光谱学与光谱分析, 2019, 39 (8): 2624, 网络出版: 2019-09-02  

A型恒星光谱线指数岭回归有效温度的预测分析

Line Index of A-Type Stellar Astronomical Spectrum Predict Effective Temperature by Ridge Regression Model
作者单位
1 齐齐哈尔大学计算机与控制工程学院, 黑龙江 齐齐哈尔 161006
2 梧州学院大数据与软件工程学院, 广西 梧州 543002
摘要
天文光谱线指数数据能够较好地保留着恒星的物理特征信息, 为此借助线指数特征数据构建多参数模型, 有利于更好地回归分析数据的共变关系及谱线的内在规律。 世界上光谱获取率最高的施密特天文望远镜LAMOST发布的观测光谱都已经过标记, 利用天文可视化工具分析这些标记的恒星光谱线指数会产生预测因子自相关, 多元线性回归时因变量存在共线性, 导致方差较大、得到最小二乘回归系数不稳定, 虽不影响使用回归的有效性, 但较难从回归方程中得到独立预测因子的评估系数。 利用LAMOST巡天光谱数据中A型恒星Lick线指数为数据源, 选取有效温度Teff为7 000~8 500 K, 取信噪比大于50的光谱特征值实现回归分析恒星参数Teff值, 经箱线图呈现DR5星表中, A型光谱86 097条具备Teff值大样本光谱数据的整体分布, 统计分析26种线指数的特征值后, 选取分布相似且带宽为12 的kp12, halpha12和hgamma12字段, 减少解释线指数变量的数目, 优化冗余变量方差膨胀因子(VIF)系数。 实验选取两两变量间观测数据集, 局部拟合回归散点、 同样的数据源使用散点图的总体轮廓生成高密度散点图, 利用色差透明性突出显示数据密集区域。 结果表明多元线性回归和岭回归算法都能从低分辨率光谱中确定A型恒星的有效温度, 但经过共线性数据分析有偏估计实验, 使用岭回归分析寻找最佳模型, 能更准确地确定恒星有效温度, 进而得到预测A型恒星有效温度及谱线回归特性。
Abstract
Line index is widely used in describing the features of spectral lines for astronomical objects because it retains the main physical characteristic information of these objects. Based on line index, a multi-parameter model for regression analysis could be used to uncover co-variation relationship of data and the inherent laws of spectral lines. The observed spectra released by LAMOST, which has the highest spectra acquisition capability, provide us with real data for establishing a robust regression model. The multivariate linear regression was applied to get the co-linearity of the dependent variables, however, it resulted in large variance. It is unstable to obtain the least squares regression coefficient sometimes. Especially, it’s difficult for the multivariate linear regression to obtain the evaluation coefficient of independent predictor from the regression equation. In this paper, we use the A-type stellar Lick line index in the LAMOST survey data as the data source. Selecting the spectra with effective temperature (Teff) from 7 000 to 8 500 K, and the signal-to-noise ratio higher than 50 to realize the regression analysis. After a set of linear biased estimation experiment for A-type stars, the method of ridge regression training was employed. In the catalogue of LAMOST data release 5 (DR5), 86 097 A-type spectra have provided the Teff value. After statistical analysis of the eigenvalues of 26 line indices, the kp12, halpha12 and hgamma12 with similar distribution and bandwidth of 12  were selected to reduce the data redundance. The number of variety was optimized for the redundant variable variance expansion factor (VIF) coefficient. Two regression experiments selected the same observation dataset to locally fit the regression scatter, using the overall contour of the scatter plot to generate a high-density scatter plot, highlighting the data-intensive region with the color difference transparency. The results show that both the multiple linear regression and the ridge regression algorithm can determine the effective temperature (Teff) of the A-type star through the low-resolution spectrum, but the co-linearity data analysis has some biased estimation. The ridge regression model can more accurately predict the effective temperature of A type stars from the low resolution spectra.

薛仁政, 陈淑鑫, 黄宏本. A型恒星光谱线指数岭回归有效温度的预测分析[J]. 光谱学与光谱分析, 2019, 39(8): 2624. XUE Ren-zheng, CHEN Shu-xin, HUANG Hong-ben. Line Index of A-Type Stellar Astronomical Spectrum Predict Effective Temperature by Ridge Regression Model[J]. Spectroscopy and Spectral Analysis, 2019, 39(8): 2624.

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!