Environment Recognition for Digital Audio Forensics Using MPEG-7 and Mel Cepstral Features

Environment Recognition for Digital Audio Forensics Using MPEG-7 and Mel
 Cepstral Features


Ghulam Muhammad1, 2 and Khaled Alghathbar1
1Center of Excellence in Information Assurance, King Saud University, Saudi Arabia
2Department of Computer Engineering, King Saud University, Saudi Arabia

Abstract:
Environment recognition from digital audio for forensics application is a growing area of interest. However, compared to other branches of audio forensics, it is a less researched one. Especially less attention has been given to detect environment from files where foreground speech is present, which is a forensics scenario. In this paper, we perform several experiments focusing on the problems of environment recognition from audio particularly for forensics application. Experimental results show that the task is easier when audio files contain only environmental sound than when they contain both foreground speech and background environment. We propose a full set of MPEG-7 audio features combined with Mel Frequency Cepstral Coefficients (MFCCs) to improve the accuracy. In the experiments, the proposed approach significantly increases the recognition accuracy of environment sound even in the presence of high amount of foreground human speech.


Keywords: Audio forensics, environment recognition, MPEG-7 audio, MFCC.

 
Received March 8, 2010; accepted October 24, 2010
Read 3233 times Last modified on Tuesday, 14 May 2019 03:49
Share
Top
We use cookies to improve our website. By continuing to use this website, you are giving consent to cookies being used. More details…