Detection of Non-Speech Human Sounds for Surveillance

The objective of this research is to develop feature extraction and classification techniques for the task of Acoustic Event Detection (AED) in unstructured environments, which are those where adverse effects such as noise, distortion and multiple sources are likely to occur. The objective is to design a system that can achieve human-like sound recognition performance on a range of hearing tasks in different circumstances. The research is important, as the field is commonly overshadowed by the more popular area of Automatic Speech Recognition (ASR), and typical AED systems are often based on techniques taken directly from this. However, the direct application presents difficulties, as the characteristics of acoustic events are less well defined than those of speech, and there is no sub-word dictionary available like the phonemes in speech. Therefore, it is relevant to develop a system that can accomplish well for this challenging task.

Keywords: acoustic event detection, Feature extraction, Classification, Deep belief networks.


