An Approach towards Video Captioning in Bengali

M. M. Rushadul Mannan; Mostafizur  Rahman; Md. Shahir  Zaoad; Md. Mahbubur Rahman; Angshu Bikash Mandol; Md Adnanul Islam

You are watching the latest version of this publication, Version 1.

article

An Approach towards Video Captioning in Bengali

[version 1]

13/12/2022| By

M. M. Rushadul Mannan,

+ 4

Md Adnanul Islam

375 Views

0 Comments

Disciplines

Computer science

Artificial intelligence

Keywords

Bengali Video Captioning

Convolutional Neural Network

Recurrent Neural Network

Long Short-term Memory

Abstract

Video captioning refers to the process of predicting a semantically consistent textual description from a given video clip. Even though a significant amount of research work is present for video captioning in English, for Bengali the field of video captioning is nearly unexplored. Therefore, this research aims at generating Bengali captions that plausibly describe the gist of a specific short video. To accomplish this, Long Short-Term Memory (LSTM) based a sequence-to-sequence model is used that takes the video frame features as input and generates an analogous textual description. In this study, Microsoft Research Video Description Corpus (MSVD) dataset is used which is an English dataset. Therefore, a deep learning-based translator and manual labor are used to convert English captions into appropriate Bengali ones. Finally, the model's performance is evaluated using popular evaluation metrics - BLEU and TER. The proposed approach achieves BLEU and TER scores of 0.38 and 0.76 respectively, establishing a new benchmark for the Bengali video captioning tasks.

Show Less

ARTICLE

REFERENCES

FILES

REVIEWS

COMMENTS (0)

Preprint

Indexed by
OpenAIRE

Submitted by13 Dec 2022

Md Adnanul Islam

Monash University

Download Publication

PDF DOCX

More details

License: CC0
Review type: Open Review
Publication type: Article
ISSN: 2094-0343
Volume: 71
Journal title: Mathematical Statistician and Engineering Applications

Citation

Mannan, M., Rahman, M., Zaoad, M., Rahman, M., Mandol, A. & Islam, M. (2022). An Approach towards Video Captioning in Bengali [version 1] [preprint]. Orvium Community.

BibTeX

No reviews to show. Please remember to LOG IN as some reviews may be only visible to specific users.