Accent Recognition for Noisy Audio Signals

Authors

  • Zichen Ma Center for Quality and Applied Statistics Rochester Institute of Technology 98 Lomb Memorial Drive, Rochester NY 14623, USA
  • Ernest Fokoué Center for Quality and Applied Statistics Rochester Institute of Technology 98 Lomb Memorial Drive, Rochester NY 14623, USA

DOI:

https://doi.org/10.55630/sjc.2014.8.169-182

Keywords:

Ill-Posed Problem, Feature Extraction, Mel-Frequency Cepstral Coefficients, Discriminant Analysis, Support Vector Machine, K-Nearest Neighbors, Autoregressive Noise

Abstract

It is well established that accent recognition can be as accurate
as up to 95% when the signals are noise-free, using feature extraction techniques
such as mel-frequency cepstral coefficients and binary classifiers such as discriminant
analysis, support vector machine and k-nearest neighbors. In this paper, we demonstrate
that the predictive performance can be reduced by as much as 15% when the signals are noisy.
Specifically, in this paper we perturb the signals with different levels of white noise,
and as the noise become stronger, the out-of-sample predictive performance deteriorates
from 95% to 80%, although the in-sample prediction gives overly-optimistic results.

Downloads

Published

2015-04-07

Issue

Section

Articles