Mi DSpace
Please use this identifier to cite or link to this item: http://hdl.handle.net/UCSP/15835
Title: Acoustic Event Classification using spectral band selection and Non-Negative Matrix Factorization-based features
Authors: Ludeña Choez, Jimmy
Gallardo Antolín, Ascensión
metadata.dc.contributor.advisor: https://purl.org/pe-repo/ocde/ford#2.02.01
Keywords: Extraction;Factorization;Feature extraction;Frequency bands;Integration;Matrix algebra;Speech recognition;Support vector machines;Acoustic event classification;Feature extraction methods;Mel frequency cepstrum coefficients;Mutual informations;Nonnegative matrix factorization;Parametric representations;Spectral representations;Temporal feature integrations;Classification (of information)
Issue Date: 2016
Publisher: Elsevier Ltd
metadata.dc.relation.uri: https://www.scopus.com/inward/record.uri?eid=2-s2.0-84946616129&doi=10.1016%2fj.eswa.2015.10.018&partnerID=40&md5=99b38cb7420d4d402bdc179cb63fec7d
Abstract: Feature extraction methods for sound events have been traditionally based on parametric representations specifically developed for speech signals, such as the well-known Mel Frequency Cepstrum Coefficients (MFCC). However, the discrimination capabilities of these features for Acoustic Event Classification (AEC) tasks could be enhanced by taking into account the spectro-temporal structure of acoustic event signals. In this paper, a new front-end for AEC which incorporates this specific information is proposed. It consists of two different stages: short-time feature extraction and temporal feature integration. The first module aims at providing a better spectral representation of the different acoustic events on a frame-by-frame basis, by means of the automatic selection of the optimal set of frequency bands from which cepstral-like features are extracted. The second stage is designed for capturing the most relevant temporal information in the short-time features, through the application of Non-Negative Matrix Factorization (NMF) on their periodograms computed over long audio segments. The whole front-end has been evaluated in clean and noisy conditions. Experiments show that the removal of certain frequency bands (which are mainly located in the medium region of the spectrum for clean conditions and in low frequencies for noisy environments) in the short-time feature computation process in conjunction with the NMF technique for temporal feature integration improves significantly the performance of a Support Vector Machine (SVM) based AEC system with respect to the use of conventional MFCCs. © 2015 Elsevier Ltd. All rights reserved.
URI: http://repositorio.ucsp.edu.pe/handle/UCSP/15835
ISSN: 9574174
Appears in Collections:Artículos - Ingeniería Electrónica y de Telecomunicaciones

Files in This Item:
There are no files associated with this item.

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.