Automatic Infant Cry Pattern Classification for a Multiclass Problem

Authors

  • N.S.A. Wahid School of Computer and Communication Engineering, Universiti Malaysia Perlis, 02600 Perlis, Malaysia
  • P. Saad School of Computer and Communication Engineering, Universiti Malaysia Perlis, 02600 Perlis, Malaysia
  • M. Hariharan School of Mechatronic Engineering, Universiti Malaysia Perlis, 02600 Perlis, Malaysia

Keywords:

Artificial Neural Network, Dynamic Features, Feature Selection, Infant Cry Classification,

Abstract

Crying is the only way of communication for infants to express their physical and emotional needs. Automatic infant cry analysis that provides fast and non-invasive process is suitable to assess the physical and emotional states of infants. The cry analysis provides an opportunity to understand infants’ needs. It is also beneficial in clinical environment for identifying specific pathologies through infant cry. This paper presents an automatic infant cry classification system for a multiclass problem. The cry classification system consists of three stages: (1) feature extraction, (2) feature selection, and (3) pattern classification. We extracted spectral features, such as Mel Frequency Cepstral Coefficients (MFCC) and Linear Prediction Cepstral Coefficients (LPCC) to represent the acoustic characteristics of the cry signals. In addition, the combination of spectral and dynamic features was also investigated. Due to the high dimensionality of data resulting from the feature extraction stage, we selected relevant features to perform feature selection to reduce the data dimensionality. In this stage, five different feature selection techniques were experimented. In the pattern classification stage, two Artificial Neural Network (ANN) architectures: Multilayer Perceptron (MLP) and Radial Basis Function Network (RBFN) were used for classifying the cry signals into five categories: asphyxia, pain, hunger, deaf, and normal. Experimental results show that the best classification accuracy of 93.43% (Kappa value of 0.91) was obtained from MFCC + ∆MFCC + ∆∆MFCC feature set, when using CFS selection technique and RBFN.

Downloads

Download data is not yet available.

Downloads

Published

2016-12-01

How to Cite

Wahid, N., Saad, P., & Hariharan, M. (2016). Automatic Infant Cry Pattern Classification for a Multiclass Problem. Journal of Telecommunication, Electronic and Computer Engineering (JTEC), 8(9), 45–52. Retrieved from https://jtec.utem.edu.my/jtec/article/view/838