Utilizing External Knowledge in Natural Language Video Localization 


Vol. 49,  No. 12, pp. 1097-1107, Dec.  2022
10.5626/JOK.2022.49.12.1097


PDF

  Abstract

State-of-the-art Natural Language Video Localization (NLVL) models mostly use existing labels to train. The use of either full-supervision or weak-supervision needs costly annotations, which are not applicable to the real-world NLVL problems. Thus, in this study, we propose the framework of External Knowledge-based Natural Language Video Localization (EK-NLVL), which leverages the idea of generating the pseudo-supervision based on a captioning model that generates sentences from the given frames and summarizes them to ground the video event. Moreover, we propose data augmentation using the pre-trained multi-modal representation learning model CLIP for visual-aligned sentence filtering to generate pseudo-sentences that could effectively provide better quality augmentation. We also propose a new model, Query-Attentive on Segmentations Network (QAS) which effectively uses external knowledge for the NLVL task. Experiments using the Charades-STA dataset demonstrated the efficacy of our method compared to the existing models.


  Statistics
Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.


  Cite this article

[IEEE Style]

D. Kim, D. Ahn, J. Choi, "Utilizing External Knowledge in Natural Language Video Localization," Journal of KIISE, JOK, vol. 49, no. 12, pp. 1097-1107, 2022. DOI: 10.5626/JOK.2022.49.12.1097.


[ACM Style]

Daneul Kim, Daechul Ahn, and Jonghyun Choi. 2022. Utilizing External Knowledge in Natural Language Video Localization. Journal of KIISE, JOK, 49, 12, (2022), 1097-1107. DOI: 10.5626/JOK.2022.49.12.1097.


[KCI Style]

Daneul Kim, Daechul Ahn, Jonghyun Choi, "Utilizing External Knowledge in Natural Language Video Localization," 한국정보과학회 논문지, 제49권, 제12호, 1097~1107쪽, 2022. DOI: 10.5626/JOK.2022.49.12.1097.


[Endnote/Zotero/Mendeley (RIS)]  Download


[BibTeX]  Download



Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr