-
arXiv:2101.05954 [pdf, ps, other]
Recent Advances in Video Question Answering: A Review of Datasets and Methods
Abstract: Video Question Answering (VQA) is a recent emerging challenging task in the field of Computer Vision. Several visual information retrieval techniques like Video Captioning/Description and Video-guided Machine Translation have preceded the task of VQA. VQA helps to retrieve temporal and spatial information from the video scenes and interpret it. In this survey, we review a number of methods and dat… ▽ More
Submitted 18 March, 2021; v1 submitted 14 January, 2021; originally announced January 2021.
Comments: 18 pages, 5 tables, Video and Image Question Answering Workshop, 25th International Conference on Pattern Recognition
Journal ref: Pattern Recognition. ICPR International Workshops and Challenges. ICPR 2021. Lecture Notes in Computer Science, vol 12662. Springer
-
Comparative Study of Machine Learning Models and BERT on SQuAD
Abstract: This study aims to provide a comparative analysis of performance of certain models popular in machine learning and the BERT model on the Stanford Question Answering Dataset (SQuAD). The analysis shows that the BERT model, which was once state-of-the-art on SQuAD, gives higher accuracy in comparison to other models. However, BERT requires a greater execution time even when only 100 samples are used… ▽ More
Submitted 22 May, 2020; originally announced May 2020.