激光与光电子学进展, 2021, 58 (4): 0410012, 网络出版: 2021-02-24  

结合场景描述的文本生成图像方法 下载: 1002次

Text Image Generation Method with Scene Description
作者单位
江西理工大学信息工程学院, 江西 赣州 341000
引用该论文

黄友文, 周斌, 唐欣. 结合场景描述的文本生成图像方法[J]. 激光与光电子学进展, 2021, 58(4): 0410012.

Youwen Huang, Bin Zhou, Xin Tang. Text Image Generation Method with Scene Description[J]. Laser & Optoelectronics Progress, 2021, 58(4): 0410012.

参考文献

[1] 岳师怡. 基于多层级上下文信息的图像语义分割[J]. 激光与光电子学进展, 2019, 56(24): 241005.

    Yue S Y. Image semantic segmentation based on hierarchical context information[J]. Laser & Optoelectronics Progress, 2019, 56(24): 241005.

[2] 李立凯, 卢炽华, 邹斌. 基于深度学习的目标检测与可行域分割研究[J]. 激光与光电子学进展, 2020, 57(12): 121013.

    Li L K, Lu C H, Zou B. Research on target detection and feasible region segmentation based on deep learning[J]. Laser & Optoelectronics Progress, 2020, 57(12): 121013.

[3] 刘万军, 王凤, 曲海成. 融合多尺度特征的目标检测模型[J]. 激光与光电子学进展, 2019, 56(23): 231007.

    Liu W J, Wang F, Qu H C. Object detection model based on multi-scale feature integration[J]. Laser & Optoelectronics Progress, 2019, 56(23): 231007.

[4] Goodfellow IJ, Pouget-AbadieJ, MirzaM, et al. Generative adversarial nets[C]//Proceedings of the 27th International Conference on Neural Information Prccessing Systems, December 7-12, 2014, Montreal, Quebec, Canada. New York: Curran Associates, 2014, 2: 2672- 2680.

[5] 陈晓范, 申海杰, 边倩, 等. 结合注意力机制的人脸超分辨率重建[J]. 西安电子科技大学学报(自然科学版), 2019, 46(3): 148-153.

    Chen X F, Shen H J, Bian Q, et al. Face image super-resolution with an attention mechanism[J]. Journal of Xidian University (Natural Science), 2019, 46(3): 148-153.

[6] ReedS, AkataZ, Yan XC, et al. ( 2016-06-05)[2020-05-28]. org/abs/1605. 05396. https://arxiv.

[7] MirzaM, Osindero S. Conditional generative adversarial nets[EB/OL]. ( 2014-11-06)[2020-05-28]. org/abs/1411. 1784. https://arxiv.

[8] ZhangH, XuT, Li HS, et al.StackGAN: text to photo-realistic image synthesis with stacked generative adversarial networks[C]//2017 IEEE International Conference on Computer Vision (ICCV), October 22-29, 2017, Venice, Italy. New York: IEEE Press, 2017: 5908- 5916.

[9] Zhang H, Xu T, Li H S, et al. StackGAN++: realistic image synthesis with stacked generative adversarial networks[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2019, 41(8): 1947-1962.

[10] XuT, Zhang PC, Huang QY, et al.AttnGAN: fine-grained text to image generation with attentional generative adversarial networks[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA. New York: IEEE Press, 2018: 1316- 1324.

[11] JohnsonJ, GuptaA, Fei-FeiL. Image generation from scene graphs[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA.New York: IEEE Press, 2018: 1219- 1228.

[12] AshualO, WolfL. Specifying object attributes and relations in interactive scene generation[C]//2019 IEEE/CVF International Conference on Computer Vision (ICCV), October 27 - November 2, 2019, Seoul, Korea (South).New York: IEEE Press, 2019: 4560- 4568.

[13] 罗佳, 黄晋英. 生成式对抗网络研究综述[J]. 仪器仪表学报, 2019, 40(3): 74-84.

    Luo J, Huang J Y. Generative adversarial network: an overview[J]. Chinese Journal of Scientific Instrument, 2019, 40(3): 74-84.

[14] Chen QF, KoltunV. Photographic image synthesis with cascaded refinement networks[C]//2017 IEEE International Conference on Computer Vision (ICCV), October 22-29, 2017, Venice, Italy.New York: IEEE Press, 2017: 1520- 1529.

[15] Krishna R, Zhu Y K, Groth O, et al. Visual genome: connecting language and vision using crowdsourced dense image annotations[J]. International Journal of Computer Vision, 2017, 123(1): 32-73.

[16] Hu RH, DollárP, He KM, et al.Learning to segment every thing[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA.New York: IEEE Press, 2018: 4233- 4241.

[17] He KM, GkioxariG, DollárP, et al.Mask R-CNN[C]//2017 IEEE International Conference on Computer Vision (ICCV), October 22-29, 2017, Venice, Italy.New York: IEEE Press, 2017: 2980- 2988.

[18] Wang TC, Liu MY, Zhu JY, et al.High-resolution image synthesis and semantic manipulation with conditional GANs[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18-23, 2018, Salt Lake City, UT, USA.New York: IEEE Press, 2018: 8798- 8807.

[19] SalimansT, GoodfellowI, ZarembaW, et al. ( 2016-06-10)[2020-05-28]. org/abs/1606. 03498. https://arxiv.

[20] HeuselM, RamsauerH, UnterthinerT, et al. ( 2018-01-12)[2020-05-28]. org/abs/1706. 08500. https://arxiv.

黄友文, 周斌, 唐欣. 结合场景描述的文本生成图像方法[J]. 激光与光电子学进展, 2021, 58(4): 0410012. Youwen Huang, Bin Zhou, Xin Tang. Text Image Generation Method with Scene Description[J]. Laser & Optoelectronics Progress, 2021, 58(4): 0410012.

引用该论文: TXT   |   EndNote

相关论文

加载中...

关于本站 Cookie 的使用提示

中国光学期刊网使用基于 cookie 的技术来更好地为您提供各项服务,点击此处了解我们的隐私策略。 如您需继续使用本网站,请您授权我们使用本地 cookie 来保存部分信息。
全站搜索
您最值得信赖的光电行业旗舰网络服务平台!