Platform logo
Explore Communities
Orvium Community logo
Orvium CommunityCommunity hosting publication
You are watching the latest version of this publication, Version 1.
article

An Approach towards Video Captioning in Bengali

[version 1]

13/12/2022| By
M. M. Rushadul M. M. Rushadul Mannan,
+ 4
Md Adnanul Md Adnanul Islam
375 Views
0 Comments
Disciplines
Keywords
Abstract

Video captioning refers to the process of predicting a semantically consistent textual description from a given video clip. Even though a significant amount of research work is present for video captioning in English, for Bengali the field of video captioning is nearly unexplored. Therefore, this research aims at generating Bengali captions that plausibly describe the gist of a specific short video. To accomplish this, Long Short-Term Memory (LSTM) based a sequence-to-sequence model is used that takes the video frame features as input and generates an analogous textual description. In this study, Microsoft Research Video Description Corpus (MSVD) dataset is used which is an English dataset. Therefore, a deep learning-based translator and manual labor are used to convert English captions into appropriate Bengali ones. Finally, the model's performance is evaluated using popular evaluation metrics - BLEU and TER. The proposed approach achieves BLEU and TER scores of 0.38 and 0.76 respectively, establishing a new benchmark for the Bengali video captioning tasks.

Show Less
Submitted by13 Dec 2022
User Avatar
Md Adnanul Islam
Monash University
Download Publication

More details

  • License: CC0
  • Review type: Open Review
  • Publication type: Article
  • ISSN: 2094-0343
  • Volume: 71
  • Journal title: Mathematical Statistician and Engineering Applications

No reviews to show. Please remember to LOG IN as some reviews may be only visible to specific users.