光谱学与光谱分析, 2012, 32 (2): 510, 网络出版: 2012-02-20   

基于随机森林的激变变星候选体的数据挖掘

Data Mining Approach to Cataclysmic Variables Candidates Based on Random Forest Algorithm
姜斌 1,2,3,*罗阿理 1赵永恒 1
作者单位
1 中国科学院国家天文台, 北京 100012
2 山东大学威海分校机电与信息工程学院, 山东 威海 264209
3 中国科学院研究生院, 北京 100049
摘要
提出一种适用于在郭守敬望远镜海量光谱中自动、 快速筛选激变变星的方法。 利用已证认的激变变星光谱作为模板, 通过随机森林分类训练, 得到一个分类模型, 该模型给出了各个波长对应流量的重要性排序, 可根据该排序进行降维并用于激变变星判别, 结果作为反馈进一步丰富模板库。 实验中共发现了16个新的激变变星候选体, 表明了该方法的可行性。
Abstract
An automatic and efficient method for cataclysmic variables candidates is presented in the present paper. The identified CVs were selected as templates. A model was constructed by random forest algorithm with templates and random selected spectra. Wavelength ranking was described by the model and the classifier was constructed afterwards. Most of the non-candidates were excluded by the method. Template matching strategy was used to identify the final candidates which were analyzed to complement the templates as feedback. 16 new CVs candidates were found in the experiment that shows that our approach to finding special celestial bodies can be feasible in LAMOST.

姜斌, 罗阿理, 赵永恒. 基于随机森林的激变变星候选体的数据挖掘[J]. 光谱学与光谱分析, 2012, 32(2): 510. JIANG Bin, LUO A-li, ZHAO Yong-heng. Data Mining Approach to Cataclysmic Variables Candidates Based on Random Forest Algorithm[J]. Spectroscopy and Spectral Analysis, 2012, 32(2): 510.

本文已被 1 篇论文引用
被引统计数据来源于中国光学期刊网
引用该论文: TXT   |   EndNote

相关论文

加载中...

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!