Acoustic Analysis of Nigerian English Vowels Based on Accents

Authors

  • A F Atanda Institute of Advance & Smart Digital Opportunities (IASDO). School of Computing Universiti Utara Malaysia (UUM), Sintok, Kedah, Malaysia.
  • S A Mohd Yusof School of Computing Universiti Utara Malaysia (UUM), Sintok, Kedah, Malaysia.
  • H Husni School of Computing Universiti Utara Malaysia (UUM), Sintok, Kedah, Malaysia.

Keywords:

Accent Recognition, Acoustic Analysis, Automatic Speech Recognition, Formant Analysis, Nigeria English,

Abstract

Accent has been widely acclaimed to be a major source of automatic speech recognition (ASR) performance degradation. Most ASR applications were developed with native English speaker speech samples not minding the fact that the majority of its potential users speaks English as a second language with a marked accent. Nigeria like most nations colonized by Britain, speaks English as official language despite being a multi-ethnic nation. This work explores the acoustic features of energy, fundamental frequency and the first three formats of the three major ethnic groups of Nigerian based on features extracted from five pure vowels of English obtained from subjects who are Nigerians. This research aimed at determining the differences or otherwise between the pronunciations of the three major ethnic nationalities in Nigeria to aid the development of ASR that is robust to NE accent. The results show that there exist significant differences between the mean values of the pure English vowels based on the pronunciation of the three major ethnics: Hausa, Ibo, and Yoruba. The differences can be explored to enhance the performance of ASR in recognition of NE.

References

Furui S. Fifty years of progress in speech and speaker recognition. The Journal of the Acoustical Society of America. 2004;116:2497.

Hariharan M, Chee LS, Ai OC, Yaacob S. Classification of speech dysfluencies using LPC based parameterization techniques. Journal of medical systems. 2012;36:1821-30.

Huang C-L, Wu C-H. Spoken document retrieval using multilevel knowledge and semantic verification. Audio, Speech, and Language Processing, IEEE Transactions on. 2007;15:2551-60.

Crystal D. English as a global language: Cambridge University Press; 2012.

Foley J. English in new cultural contexts: Reflections from Singapore: Oxford University Press; 1998.

Sharbawi SH. An acoustic investigation of the segmental features of educated Brunei English speech, 2010.

Gut U. Nigerian English: Phonology. A handbook of varieties of English. 2004;1:992-1002.

Olaniyi OK. The taxonomy of Nigerian varieties of spoken English. International Journal of English and Literature. 2014;5(9):232-40.

Scharenborg O, Cooke M, editors. Comparing human and machine recognition performance on a VCV corpus. Proc Workshop on Speech Analysis and Processing for Knowledge Discovery; 2008.

You H, Adviser-Alwan A. Robust automatic speech recognition algorithms for dealing with noise and accent: University of California at Los Angeles; 2009.

Lippmann RP., Speech recognition by machines and humans. Speech Communication. 1997;22:1-15.

Stern RM, Morgan N., Hearing Is Believing: Biologically-‐Inspired Feature Extraction For Robust Automatic Speech Recognition. 2012.

Faria A., Accent classification for speech recognition. Machine Learning for Multimodal Interaction: Springer; 2006. p. 285-93.

Hanani A, Russell MJ, Carey MJ., Human and computer recognition of regional accents and ethnic groups from British English speech. Computer Speech & Language. 2013;27:59-74.

Vergyri D, Lamel L, Gauvain J-L, editors., Automatic speech recognition of multiple accented English data. INTERSPEECH; 2010.

Amuda SAY, Boril H, Sangwan A, Ibiyemi TS, Hansen JH. Engineering Analysis and Recognition of Nigerian English: An Insight into Low Resource Languages. Transactions on Machine Learning and Artificial Intelligence. 2014;2(4):115-28.

Gaikwad S, Gawali B, Kale K., Accent Recognition for Indian English using Acoustic Feature Approach. International Journal of Computer Applications. 2013;63(7).

Yusnita M, Paulraj MP, Yaacob S, Bakar SA, Saidatul A, editors., Malaysian English accents identification using LPC and formant analysis. Control System, Computing and Engineering (ICCSCE), 2011 IEEE International Conference on; 2011: IEEE.

Azmi MS., Development of Malay Word Pronunciation Application using Vowel Recognition. Malay. 2016;9(1).

Yusnita M., Investigation of Robust Speech Feature Extraction Techniques for Accents Classification of Malaysian English Speakers [Ph.D]: Universiti Malaysia Perlis; 2014.

Evans J, Chu M-n, Aston JA, Su C-y., Linguistic and human effects on F0 in a tonal dialect of Qiang. Phonetica. 2010;67:82-99.

Ghai W, Singh N., Literature Review on Automatic Speech Recognition. International Journal of Computer Applications. 2012;41(8):42-50.

Hao Y-C., Second language acquisition of Mandarin Chinese tones by tonal and non-tonal language speakers. Journal of phonetics. 2012;40:269-79.

Jurafsky D, Martin JH., Speech and language processing: Pearson; 2014.

Shrawankar U, Thakare VM., Techniques for feature extraction in speech recognition system: a comparative study. arXiv preprint arXiv:13051145. 2013.

Arslan L, Hansen J., A study of temporal features and frequency characteristics in American English foreign accent. The Journal of the Acoustical Society of America. 1997;102(1):28-40.

Rabiner LR, Schafer RW., Theory and application of digital speech processing. Preliminary Edition. 2009.

Downloads

Published

2017-11-30

How to Cite

Atanda, A. F., Mohd Yusof, S. A., & Husni, H. (2017). Acoustic Analysis of Nigerian English Vowels Based on Accents. Journal of Telecommunication, Electronic and Computer Engineering (JTEC), 9(3-7), 13–20. Retrieved from https://jtec.utem.edu.my/jtec/article/view/3059