DefinePK

DefinePK hosts the largest index of Pakistani journals, research articles, news headlines, and videos. It also offers chapter-level book search.

Efficient Region-Based Video Text Extraction Using Advanced Detection and Recognition Models


Article Information

Title: Efficient Region-Based Video Text Extraction Using Advanced Detection and Recognition Models

Authors: Naveed Ahmed, Zahid Iqbal, Abdullah Nawaz, Huah Yong Chan, Fatima N. AL-Aswadi, Hafiz Usman Zia

Journal: International Journal of Innovations in Science & Technology

HEC Recognition History
Category From To
Y 2024-10-01 2025-12-31
Y 2023-07-01 2024-09-30
Y 2021-07-01 2022-06-30

Publisher: 50SEA JOURNALS (SMC-PRIVATE) LIMITED

Country: Pakistan

Year: 2025

Volume: 7

Issue: 5

Language: en

Keywords: optical character recognitionDeep Learning in Linguisticsvideo analysisScene Text DetectionScene Text Recognition

Categories

Abstract

This paper presents an automated process for extracting text from video frames by specifically targeting text-rich regions, identified through advanced scene text detection methods. Unlike traditional techniques that apply OCR to entire frames—resulting in excessive computations and higher error rates—our approach focuses only on textual areas, improving both speed and accuracy. The system integrates effective preprocessing routines, cutting-edge text detectors (CRAFT, DBNet), and advanced recognition engines (CRNN, transformer-based) within a unified framework. Extensive testing on datasets such as ICDAR 2015, ICDAR 2017 MLT, and COCO-Text demonstrates consistent gains in F-scores and word recognition rates, significantly outperforming baseline methods. Additionally, detailed error analysis, ablation studies, and runtime evaluations offer deeper insights into the strengths and limitations of the proposed method. This pipeline is particularly useful for tasks like video indexing, semantic retrieval, and real-time multimedia analysis.


Paper summary is not available for this article yet.

Loading PDF...

Loading Statistics...