Accent Recognition for Noisy Audio Signals

Zichen Ma; Ernest Fokoué

doi:10.55630/sjc.2014.8.169-182

Authors

Zichen Ma Center for Quality and Applied Statistics Rochester Institute of Technology 98 Lomb Memorial Drive, Rochester NY 14623, USA
Ernest Fokoué Center for Quality and Applied Statistics Rochester Institute of Technology 98 Lomb Memorial Drive, Rochester NY 14623, USA

DOI:

https://doi.org/10.55630/sjc.2014.8.169-182

Keywords:

Ill-Posed Problem, Feature Extraction, Mel-Frequency Cepstral Coefficients, Discriminant Analysis, Support Vector Machine, K-Nearest Neighbors, Autoregressive Noise

Abstract

It is well established that accent recognition can be as accurate
as up to 95% when the signals are noise-free, using feature extraction techniques
such as mel-frequency cepstral coefficients and binary classifiers such as discriminant
analysis, support vector machine and k-nearest neighbors. In this paper, we demonstrate
that the predictive performance can be reduced by as much as 15% when the signals are noisy.
Specifically, in this paper we perturb the signals with different levels of white noise,
and as the noise become stronger, the out-of-sample predictive performance deteriorates
from 95% to 80%, although the in-sample prediction gives overly-optimistic results.

Accent Recognition for Noisy Audio Signals

Authors

DOI:

Keywords:

Abstract

Downloads

Published

Issue

Section

Cover

ISSN

Index