Skip to content

Latest commit

 

History

History
26 lines (24 loc) · 6.96 KB

video_retrieval_captioning.md

File metadata and controls

26 lines (24 loc) · 6.96 KB

Video Retrieval and Captioning

Video-level Retrieval

  • [2015 AAAI] Jointly Modeling Deep Video and Compositional Text to Bridge Vision and Language in a Unified Framework, [paper], [bibtex].
  • [2018 ECCV] Find and Focus: Retrieve and Localize Video Events with Natural Language Queries, [paper], [bibtex].
  • [2018 ECCV] Cross-Modal and Hierarchical Modeling of Video and Text, [paper], [bibtex], sources: [zbwglory/CMHSE].
  • [2019 CVPR] Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval, [paper], [bibtex], sources: [yalesong/pvse].
  • [2020 IEEE TM] SEA: Sentence Encoder Assembly for Video Retrieval by Textual Queries, [paper], [bibtex].

Video Captioning