光学 精密工程, 2020, 28 (3): 695, 网络出版: 2020-05-12
多标签分类的传统民族服饰纹样图像语义理解
Multi-label classification of traditional national costume pattern image semantic understanding
多标签分类 融合学习 传统民族服饰 语义理解 multi-label classification fusion learning traditional national costumes semantic understanding
摘要
针对当前图像多标签分类方法只关注图像本体类别信息(本体), 而忽略图像深层次语义信息(隐义)的问题, 本文提出了一种“本体-隐义”融合学习的图像多标签分类模型。该模型首先利用CNN中间层和较高层分别学习图像的本体信息和隐义信息, 然后利用本体信息与隐义信息之间的依赖关系设计了融合学习模型, 同时对提出模型的不同中间层特征和模型的不同结构进行了深入研究, 最终实现了对图像中多类别以及各类别蕴含的隐义信息分类。在传统民族服饰纹样图像数据集上进行实验, 得到图像本体多标签分类和隐义多标签分类的mAP分别为0.88和0.82; 在Scene数据集上进行对比实验, 本文模型在Hamming loss, One-error以及Average precision指标上分别优于其他最好方法0.103, 0091和0.083, 实验结果证明了本文方法的有效性和优越性。
Abstract
Since current image multi-label classification methods only focus on the category information of image ontology (ontology) and ignore the deep semantic information of the image (implicit), this study proposed an image multi-label classification model of “ontology-implicit” fusion learning. The model first used the middle and higher layers of CNN to learn the image ontology information and implicit information, respectively, and then it used the dependency relationship between the ontology information and implicit information to design the fusion learning model. Meanwhile, the different characteristics of the middle layer and different structures of the model were studied in-depth, to realize the classification of implicit information contained in multiple image categories. Experiments conducted on the traditional national costume pattern image datasets show that the mAP of image ontology multi-label classification and implicit multi-label classification are 0.88 and 0.82, respectively. Comparative experiments conducted on the Scene dataset show that the model is superior to other methods in Hamming loss, one error, and average precision indices, with values of 0.103, 0091, and 0.083, respectively. Therefore, the experimental results prove the effectiveness and superiority of this method.
赵海英, 周伟, 侯小刚, 齐光磊. 多标签分类的传统民族服饰纹样图像语义理解[J]. 光学 精密工程, 2020, 28(3): 695. ZHAO Hai-ying, ZHOU Wei, HOU Xiao-gang, QI Guang-lei. Multi-label classification of traditional national costume pattern image semantic understanding[J]. Optics and Precision Engineering, 2020, 28(3): 695.