DefinePK

DefinePK hosts the largest index of Pakistani journals, research articles, news headlines, and videos. It also offers chapter-level book search.

Audio visual tracking of a speaker based on FFT and kalman filter


Article Information

Title: Audio visual tracking of a speaker based on FFT and kalman filter

Authors: Muhammad Muzammel, Mohd Zuki Yusoff, Mohamad Naufal Mohamad Saad, Aamir Saeed Malik

Journal: ARPN Journal of Engineering and Applied Sciences

HEC Recognition History
Category From To
Y 2023-07-01 2024-09-30
Y 2022-07-01 2023-06-30
Y 2021-07-01 2022-06-30
X 2020-07-01 2021-06-30

Publisher: Khyber Medical College, Peshawar

Country: Pakistan

Year: 2016

Volume: 11

Issue: 14

Language: English

Categories

Abstract

In this paper a simple audio visual information based speaker tracking technique is proposed for indoor environment. Specifically, a Kalman filter based image processing technique is used to extract visual information, and Fast Fourier Transform (FFT) based approach is used to extract audio information for speaker tracking. Finally, a decision tree has been used to estimate the location of the speaker based on audio and visual information. One of the main advantages of the proposed technique is the use of a built-in microphone of the tracking camera; which makes this technique cost effective and simple. We have examined our method with case studies from the online SPEVI database. The proposed technique shows the best detection and works properly even when the speaker is not visible.


Paper summary is not available for this article yet.

Loading PDF...

Loading Statistics...