Page 285 - 《软件学报》2025年第4期

P. 285

王永胜等: 多模态信息抽取研究综述 1691

Chapter of the Association for Computational Linguistics: Human Language Technologies. San Diego: ACL, 2016. 300–309. [doi: 10.
18653/v1/N16-1034]
[81] Bosselut A, Chen JF, Warren D, Hajishirzi H, Choi Y. Learning prototypical event structure from photo albums. In: Proc. of the 54th
Annual Meeting of the Association for Computational Linguistics (Vol. 1: Long Papers). Berlin: ACL, 2016. 1769–1779. [doi: 10.18653/
v1/P16-1167]
[82] Young P, Lai A, Hodosh M, Hockenmaier J. From image descriptions to visual denotations: New similarity metrics for semantic
inference over event descriptions. Trans. of the Association for Computational Linguistics, 2014, 2: 67–78. [doi: 10.1162/tacl_a_00166]
[83] Song ZY, Bies A, Strassel S, Riese T, Mott J, Ellis J, Wright J, Kulick S, Ryant N, Ma XY. From light to rich ERE: Annotation of
entities, relations, and events. In: Proc. of the the 3rd Workshop on EVENTS: Definition, Detection, Coreference, and Representation.
Denver: ACL, 2015. 89–98. [doi: 10.3115/v1/W15-0812]
[84] Li ML, Xu RC, Wang SH, Zhou LW, Lin XD, Zhu CG, Zeng M, Ji H, Chang SF. Clip-event: Connecting text and images with event
structures. In: Proc. of the 2022 IEEE/CVF Conf. on Computer Vision and Pattern Recognition. New Orleans: IEEE, 2022. 16399–16408.
[doi: 10.1109/CVPR52688.2022.01593]
[85] Moghimifar F, Shiri F, Nguyen V, Li YF, Haffari G. Theia: Weakly supervised multimodal event extraction from incomplete data. In:
Proc. of the 13th Int’l Joint Conf. on Natural Language Processing and the 3rd Conf. of the Asia-Pacific Chapter of the Association for
Computational Linguistics. Nusa Dua: ACL, 2023. 139–145. [doi: 10.18653/v1/2023.ijcnlp-short.16]
李培峰(1971－), 男, 博士, 教授, 博士生导师,
[86] Chen B, Lin XD, Thomas C, Li ML, Yoshida S, Chum L, Ji H, Chang SF. Joint multimedia event extraction from video and article. In:
Findings of the Association for Computational Linguistics: EMNLP 2021. Punta Cana: ACL, 2021. 74–88. [doi: 10.18653/v1/2021.
findings-emnlp.8]

附中文参考文献:
[1] 张亚洲, 戎璐, 宋大为, 张鹏. 多模态情感分析研究综述. 模式识别与人工智能, 2020, 33(5): 426–438. [doi: 10.16451/j.cnki.issn1003-
6059.202005005]
[2] 包希港, 周春来, 肖克晶, 覃飙. 视觉问答研究综述. 软件学报, 2021, 32(8): 2522–2544. http://www.jos.org.cn/1000-9825/6215.htm
[doi: 10.13328/j.cnki.jos.006215]
[5] 吴友政, 李浩然, 姚霆, 何晓冬. 多模态信息处理前沿综述: 应用、融合和预训练. 中文信息学报, 2022, 36(5): 1–20. [doi: 10.3969/
j.issn.1003-0077.2022.05.001]
[12] 张天明, 张杉, 刘曦, 曹斌, 范菁. 融合多模态数据的小样本命名实体识别方法. 软件学报, 2024, 35(3): 1107–1124. http://www.jos.org.
cn/1000-9825/7069.htm [doi: 10.13328/j.cnki.jos.007069]
[23] 张汝佳, 代璐, 王邦, 郭鹏. 基于深度学习的中文命名实体识别最新研究进展综述. 中文信息学报, 2022, 36(6): 20–35. [doi: 10.
3969/j.issn.1003-0077.2022.06.002]
[30] 黄世洲. 面向社交媒体的通用多模态信息抽取方法研究 [硕士学位论文]. 上海. 东华大学, 2022. [doi: 10.27012/d.cnki.gdhuu.
2022.002241]
王永胜(1990－), 男, 博士生, 主要研究领域为自王中卿(1987－), 男, 博士, 副教授, CCF 专业会
然语言处理, 信息抽取. 员, 主要研究领域为自然语言处理.

朱巧明(1963－), 男, 博士, 教授, 博士生导师,

CCF 高级会员, 主要研究领域为自然语言处理, CCF 杰出会员, 主要研究领域为中文信息处理,

机器学习. Web 信息处理.

280 281 282 283 284 285 286 287 288 289 290