Automatic Infant Cry Pattern Classification for a Multiclass Problem


  • N.S.A. Wahid School of Computer and Communication Engineering, Universiti Malaysia Perlis, 02600 Perlis, Malaysia
  • P. Saad School of Computer and Communication Engineering, Universiti Malaysia Perlis, 02600 Perlis, Malaysia
  • M. Hariharan School of Mechatronic Engineering, Universiti Malaysia Perlis, 02600 Perlis, Malaysia


Artificial Neural Network, Dynamic Features, Feature Selection, Infant Cry Classification,


Crying is the only way of communication for infants to express their physical and emotional needs. Automatic infant cry analysis that provides fast and non-invasive process is suitable to assess the physical and emotional states of infants. The cry analysis provides an opportunity to understand infants’ needs. It is also beneficial in clinical environment for identifying specific pathologies through infant cry. This paper presents an automatic infant cry classification system for a multiclass problem. The cry classification system consists of three stages: (1) feature extraction, (2) feature selection, and (3) pattern classification. We extracted spectral features, such as Mel Frequency Cepstral Coefficients (MFCC) and Linear Prediction Cepstral Coefficients (LPCC) to represent the acoustic characteristics of the cry signals. In addition, the combination of spectral and dynamic features was also investigated. Due to the high dimensionality of data resulting from the feature extraction stage, we selected relevant features to perform feature selection to reduce the data dimensionality. In this stage, five different feature selection techniques were experimented. In the pattern classification stage, two Artificial Neural Network (ANN) architectures: Multilayer Perceptron (MLP) and Radial Basis Function Network (RBFN) were used for classifying the cry signals into five categories: asphyxia, pain, hunger, deaf, and normal. Experimental results show that the best classification accuracy of 93.43% (Kappa value of 0.91) was obtained from MFCC + ∆MFCC + ∆∆MFCC feature set, when using CFS selection technique and RBFN.


M. J. Carwin, B. M. Lester, and H. L. Golub, “The infant cry: what can it tell us?,” Current Problems in Pediatrics, vol. 26, no. 9, pp. 324–334,1996.

O. Wasz-Höckert, T. J. Partanen, V. Vuorenkoski, and K. Michelsson, “The identification of some specific meanings in infant vocalization,” Experientia, vol. 20, no. 3, pp. 154–154, 1964.

S. E. Barajas-Montiel and C. A. Reyes-Garcia, “Identifying pain and hunger in infant cry with classifiers ensembles,” in 2005 International Conference on Computational Intelligence for Modelling, Control and Automation, and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC’05), 2005, vol. 2, pp. 770–775.

C. Y. Chang, Y. C. Hsiao, and S. T. Chen, “Application of Incremental SVM Learning for Infant Cries Recognition,” in 2015 18th International Conference on Network-Based Information Systems, 2015, pp. 607–610.

A. Rosales-Pérez, C. A. Reyes-García, J. A. Gonzalez, O. F. ReyesGalaviz, H. J. Escalante, and S. Orlandi, “Classifying infant cry patterns by the Genetic Selection of a Fuzzy Model,” Biomedical Signal Processing and Control, vol. 17, pp. 38–46, 2015.

J. Saraswathy, M. Hariharan, W. Khairunizam, S. Yaacob, and N. Thiyagar, “Infant cry classification: time frequency analysis,” in 2013 IEEE International Conference on Control System, Computing and Engineering (ICCSCE), 2013, pp. 499–504.

A. Zabidi, W. Mansor, L. Y. Khuan, I. M. Yassin, and R. Sahak, “Binary particle swarm optimization and F-ratio for selection of features in the recognition of asphyxiated infant cry,” in 5th European Conference of the International Federation for Medical and Biological Engineering, vol. 37, Á. Jobbágy, Ed. Springer Berlin Heidelberg, 2012, pp. 61–65.

A. Rosales-Pérez, C. A. Reyes-García, and P. Gómez-Gil, “Genetic fuzzy relational neural network for infant cry classification,” in Pattern Recognition, vol. 6718, J. F. Martínez-Trinidad, J. A. Carrasco-Ochoa, C. B.-Y. Brants, and E. R. Hancock, Eds. Springer Berlin Heidelberg, 2011, pp. 288–296.

M. Hariharan, R. Sindhu, and S. Yaacob, “Normal and hypoacoustic infant cry signal classification using time–frequency analysis and general regression neural network,” Computer methods and programs in biomedicine, vol. 108, no. 2, pp. 559–569, Nov. 2012.

J. Orozco and C. A. R. García, “Detecting pathologies from infant cry applying scaled conjugate gradient neural networks,” in European Symposium on Artificial Neural Networks, Bruges (Belgium), 2003, pp. 349–354.

J. Varallyay, Z. Benyo, A. Illenyi, Z. Farkas, and L. Kovacs, “Acoustic analysis of the infant cry: classical and new methods,” in 26th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2004 (IEMBS’04), 2004, pp. 313–316.

S. Orlandi, C. Manfredi, L. Bocchi, and M. L. Scattoni, “Automatic newborn cry analysis: a non-invasive tool to help autism early diagnosis,” in 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2012, pp. 2953–2956.

D. Lederman, E. Zmora, S. Hauschildt, A. Stellzig-Eisenhauer, and K. Wermke, “Classification of cries of infants with cleft-palate using parallel hidden markov models,” Medical and Biological Engineering and Computing, vol. 46, no. 10, pp. 965–975, 2008.

S. Orlandi, C. A. R. Garcia, A. Bandini, G. Donzelli, and C. Manfredi, “Application of pattern recognition techniques to the classification of full-term and preterm infant cry,” Journal of Voice, 2015.

J. Saraswathy, M. Hariharan, T. Nadarajaw, W. Khairunizam, and S. Yaacob, “Optimal selection of mother wavelet for accurate infant cry classification,” Australasian Physical and Engineering Sciences in Medicine, vol. 37, pp. 439–456, 2014.

J. O. Garcia and C. A. R. Garcia, “Mel-frequency cepstrum coefficients extraction from infant cry for classification of normal and pathological cry with feed-forward neural networks,” in International Joint Conference on Neural Networks, 2003, 2003, pp. 3140 – 3145.

M. M. Jam and H. Sadjedi, “Wavelet-based automatic cry recognition system for detecting infants with hearing-loss from normal infants,” The Journal of Engineering, vol. 1, no. 1, pp. 1–2, 2013.

L. Liu, K. Kuo, and S. M. Kuo, “Infant cry classification integrated ANC system for infant incubators,” in 2013 10th IEEE International Conference on Networking, Sensing and Control (ICNSC), 2013, pp. 383–387.

B. Saha, P. K. Purkait, J. Mukherjee, A. K. Majumdar, B. Majumdar, and A. K. Singh, “An embedded system for automatic classification of neonatal cry,” in 2013 IEEE Point-of-Care Healthcare Technologies (PHT), 2013, pp. 248–251.

M. Hariharan, L. S. Chee, and S. Yaacob, “Analysis of infant cry through weighted linear prediction cepstral coefficients and probabilistic neural network,” Journal of Medical Systems, vol. 36, no. 3, pp. 1309–1315, Jun. 2012.

A. K. Singh, J. Mukhopadhyay, and K. S. Rao, “Classification of infant cries using epoch and spectral features,” in 2013 National Conference on Communications (NCC), 2013, pp. 1–5.

O. F. Reyes-Galaviz, S. D. Cano-Ortiz, and C. A. Reyes-García, “Evolutionary-neural system to classify infant cry units for pathologies identification in recently born babies,” in Seventh Mexican International Conference on Artificial Intelligence, 2008 (MICAI ’08), 2008, pp. 330–335.

R. Sahak, W. Mansor, Y. K. Lee, A. I. M. Yassin, and A. Zabidi, “Orthogonal least square based support vector machine for the classification of infant cry with asphyxia,” in 2010 3rd International Conference on Biomedical Engineering and Informatics (BMEI), 2010, vol. 3, pp. 986–990.

A. Zabidi, W. Mansor, L. Y. Khuan, I. M. Yassin, and R. Sahak, “The effect of f-ratio in the classification of asphyxiated infant cries using multilayer perceptron neural network,” in 2010 IEEE EMBS Conference on Biomedical Engineering and Sciences (IECBES), 2010, pp. 126–129.

M. Hariharan, S. Yaacob, and S. A. Awang, “Pathological infant cry analysis using wavelet packet transform and probabilistic neural network,” Expert Systems with Applications, vol. 38, no. 12, pp. 15377–15382, Nov. 2011.

R. Cohen and Y. Lavner, “Infant cry analysis and detection,” in 2012 IEEE 27th Convention of Electrical & Electronics Engineers in Israel (IEEEI), 2012, pp. 1–5.

L. Liu, J. He, and G. Palm, “Signal modeling for speaker identification,” in 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing, 1996. ICASSP-96, 1996, vol. 2, no. 3, pp. 665–668.

L. Rabiner and B.-H. Juang, Fundamentals of Speech Recognition. New Jersey: Prentice Hall, 1993.

T. Kinnunen, V. Hautamäki, and P. Fränti, “Fusion of spectral feature sets for accurate speaker identification,” in 9th Conference Speech and Computer (SPECOM-2004), 2004, pp. 361–365.

J. Harrington and S. Cassidy, Techniques in Speech Acoustics. Dordrecht: Kluwer Academic Publishers, 1999.

B. S. Atal, “Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification,” Journal of the Acoustical Society of America, vol. 55, no. 6, pp. 1304–1312, 1974.

J. S. Mason and X. Zhang, “Velocity and acceleration features in speaker recognition,” in 1991 International Conference on Acoustics, Speech, and Signal Processing, 1991. ICASSP-91, 1991, pp. 3673–3676.

T. Kinnunen, “Spectral features for automatic text-independent speaker recognition (Licentiate‘s thesis),” University of Joensuu, 2003.

L. Yu and H. Liu, “Efficient Feature Selection via Analysis of Relevance and Redundancy,” Journal of Machine Learning Research, vol. 5, pp. 1205–1224, 2004.

M. Dash and H. Liu, “Consistency-based search in feature selection,” Artificial Intelligence, vol. 151, no. 1, pp. 155–176, 2003.

A. L. Blum and P. Langley, “Selection of relevant features and examples in machine learning,” Artificial Intelligence, vol. 97, no. 1–2, pp. 245–271, 1997.

R. C. Holte, “Very simple classification rules perform well on most commonly used datasets,” Machine Learning, vol. 11, no. 1, pp. 63–91, 1993.

I. Kononenko, “Estimating Attributes: Analysis and Extensions of Relief,” in Machine Learning: ECML-94, Springer Berlin Heidelberg, 1994.

L. Yu and H. Liu, “Feature selection for high-dimensional data: A fast correlation-based filter solution,” in International Conference on Machine Learning (ICML), 2003, pp. 856–863.

W. H. Press, B. P. Flannery, S. A. Teukolsky, and W. T. Vetterling, Numerical recipes in C, 1st ed. Cambridge: Cambridge University Press, 1988.

H. Liu and R. Setiono, “A probabilistic approach to feature selection - a filter solution,” in Proc 13th International Conference on Machine Learning, 1996, vol. 96, pp. 319–327.

D. E. Golberg, Genetic algorithms in search, optimization, and machine learning. Addion wesley, 1989, 1989.

M. A. Hall, “Correlation-based feature selection for machine learning,” Waikato University, New Zealand, 1999.

S. S. Chiddarwar and N. R. Babu, “Comparison of RBF and MLP neural networks to solve inverse kinematic problem for 6R serial robot by a fusion approach,” Engineering Applications of Artificial Intelligence, vol. 23, no. 7, pp. 1083–1092, 2010.

D. E. Rumelhart, G. E. Hinton, and R. J. Williams, “Learning internal representations by error propagation,” in Parallel Distributed Processing, D. E. Rumelhart and J. L. McClelland, Eds. Cambridge MA: MIT Press, 1986, pp. 318–362.

J. Moody and C. J. Darken, “Fast learning in networks of locally-tuned processing units,” Neural Computation, vol. 1, no. 2, pp. 281–294, 1998.

I. H. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques, 2nd ed. San Francisco: Morgan Kaufmann, 2005.

M. Hall and G. Holmes, “Benchmarking attribute selection techniques for data mining,” Knowledge and Data Engineering, IEEE Transactions on, vol. 15, no. 6, pp. 1437–1447, 2003.

J. Cohen, “A coefficient of agreement for nominal scales,” Educat. Psychol. Meas., vol. 20, no. 1, pp. 37–46, 1960.

A. Ozcift and A. Gulten, “A robust multi-class feature selection strategy based on rotation forest ensemble algorithm for diagnosis of erythematosquamous diseases,” Journal of Medical Systems, vol. 36, no. 2, pp. 941–949, 2012.

M. Galar, A. Fernandez, E. Barrenechea, and F. Herrera, “DRCW-OVO: Distance-based relative competence weighting combination for One-vsOne strategy in multi-class problems,” Pattern Recognition, vol. 48, no. 1, pp. 28–42, 2015.

I. Guyon and A. Elisseeff, “An Introduction to Variable and Feature Selection,” The Journal of Machine Learning Research, vol. 3, pp. 1157–1182, 2003.




How to Cite

Wahid, N., Saad, P., & Hariharan, M. (2016). Automatic Infant Cry Pattern Classification for a Multiclass Problem. Journal of Telecommunication, Electronic and Computer Engineering (JTEC), 8(9), 45–52. Retrieved from