Implementing Video OCR along with SWT Technique for Video indexing and Analysis

Paruchuru Grishman 1,* Akula Rajitha 1 Mohammed Khaja Moinuddin 1 Mannava Subhramanaya Sreekar 1 Siddam Jayanth 1

1. IARE/IT/Hyderabad, Telangana, 500043

* Corresponding author.


Received: 4 Jun. 2022 / Revised: 29 Jul. 2022 / Accepted: 25 Aug. 2022 / Published: 8 Feb. 2023

Index Terms

Optical Character Recognition, Tesseract, Binarization, Python, Segmentation, Stroke Width Transform, Open CV, Video Indexing, Image Processing.


The main purpose of this paper is to expand the usage of OCR (Optical character recognition) as this is only implemented over images and to extend this Video OCR is introduced in a way to help to retrieve the information from the video without playing the video. Video OCR is executed with the assistance of OpenCv2 module and PyTesseract [7] at the side of SWT approach which all pretty collectively make an ideal aggregate to offer an appropriate content from the video (i.e., Lecture video or any kind of video which has slides or text on the background of the video) [2,4].This technique is performed in a well-designed along with easy steps to provide us an correct end result of the facts from the video into textual files. In addition to this we also added Speech Recognition module within the project to support the video along with the text file. This speech delivered by the faculty (i.e., instructor/educator/teacher), or an educator will be also resulted in a text file.

Cite This Paper

Paruchuru Grishman, Akula Rajitha, Mohammed Khaja Moinuddin, Mannava Subhramanaya Sreekar, Siddam Jayanth, "Implementing Video OCR along with SWT Technique for Video indexing and Analysis", International Journal of Wireless and Microwave Technologies(IJWMT), Vol.13, No.1, pp. 27-35, 2023. DOI:10.5815/ijwmt.2023.01.03


