Acoustic Analysis of Nigerian English Vowels Based on Accents
Keywords:Accent Recognition, Acoustic Analysis, Automatic Speech Recognition, Formant Analysis, Nigeria English,
AbstractAccent has been widely acclaimed to be a major source of automatic speech recognition (ASR) performance degradation. Most ASR applications were developed with native English speaker speech samples not minding the fact that the majority of its potential users speaks English as a second language with a marked accent. Nigeria like most nations colonized by Britain, speaks English as official language despite being a multi-ethnic nation. This work explores the acoustic features of energy, fundamental frequency and the first three formats of the three major ethnic groups of Nigerian based on features extracted from five pure vowels of English obtained from subjects who are Nigerians. This research aimed at determining the differences or otherwise between the pronunciations of the three major ethnic nationalities in Nigeria to aid the development of ASR that is robust to NE accent. The results show that there exist significant differences between the mean values of the pure English vowels based on the pronunciation of the three major ethnics: Hausa, Ibo, and Yoruba. The differences can be explored to enhance the performance of ASR in recognition of NE.
Furui S. Fifty years of progress in speech and speaker recognition. The Journal of the Acoustical Society of America. 2004;116:2497.
Hariharan M, Chee LS, Ai OC, Yaacob S. Classification of speech dysfluencies using LPC based parameterization techniques. Journal of medical systems. 2012;36:1821-30.
Huang C-L, Wu C-H. Spoken document retrieval using multilevel knowledge and semantic verification. Audio, Speech, and Language Processing, IEEE Transactions on. 2007;15:2551-60.
Crystal D. English as a global language: Cambridge University Press; 2012.
Foley J. English in new cultural contexts: Reflections from Singapore: Oxford University Press; 1998.
Sharbawi SH. An acoustic investigation of the segmental features of educated Brunei English speech, 2010.
Gut U. Nigerian English: Phonology. A handbook of varieties of English. 2004;1:992-1002.
Olaniyi OK. The taxonomy of Nigerian varieties of spoken English. International Journal of English and Literature. 2014;5(9):232-40.
Scharenborg O, Cooke M, editors. Comparing human and machine recognition performance on a VCV corpus. Proc Workshop on Speech Analysis and Processing for Knowledge Discovery; 2008.
You H, Adviser-Alwan A. Robust automatic speech recognition algorithms for dealing with noise and accent: University of California at Los Angeles; 2009.
Lippmann RP., Speech recognition by machines and humans. Speech Communication. 1997;22:1-15.
Stern RM, Morgan N., Hearing Is Believing: Biologically-‐Inspired Feature Extraction For Robust Automatic Speech Recognition. 2012.
Faria A., Accent classification for speech recognition. Machine Learning for Multimodal Interaction: Springer; 2006. p. 285-93.
Hanani A, Russell MJ, Carey MJ., Human and computer recognition of regional accents and ethnic groups from British English speech. Computer Speech & Language. 2013;27:59-74.
Vergyri D, Lamel L, Gauvain J-L, editors., Automatic speech recognition of multiple accented English data. INTERSPEECH; 2010.
Amuda SAY, Boril H, Sangwan A, Ibiyemi TS, Hansen JH. Engineering Analysis and Recognition of Nigerian English: An Insight into Low Resource Languages. Transactions on Machine Learning and Artificial Intelligence. 2014;2(4):115-28.
Gaikwad S, Gawali B, Kale K., Accent Recognition for Indian English using Acoustic Feature Approach. International Journal of Computer Applications. 2013;63(7).
Yusnita M, Paulraj MP, Yaacob S, Bakar SA, Saidatul A, editors., Malaysian English accents identification using LPC and formant analysis. Control System, Computing and Engineering (ICCSCE), 2011 IEEE International Conference on; 2011: IEEE.
Azmi MS., Development of Malay Word Pronunciation Application using Vowel Recognition. Malay. 2016;9(1).
Yusnita M., Investigation of Robust Speech Feature Extraction Techniques for Accents Classification of Malaysian English Speakers [Ph.D]: Universiti Malaysia Perlis; 2014.
Evans J, Chu M-n, Aston JA, Su C-y., Linguistic and human effects on F0 in a tonal dialect of Qiang. Phonetica. 2010;67:82-99.
Ghai W, Singh N., Literature Review on Automatic Speech Recognition. International Journal of Computer Applications. 2012;41(8):42-50.
Hao Y-C., Second language acquisition of Mandarin Chinese tones by tonal and non-tonal language speakers. Journal of phonetics. 2012;40:269-79.
Jurafsky D, Martin JH., Speech and language processing: Pearson; 2014.
Shrawankar U, Thakare VM., Techniques for feature extraction in speech recognition system: a comparative study. arXiv preprint arXiv:13051145. 2013.
Arslan L, Hansen J., A study of temporal features and frequency characteristics in American English foreign accent. The Journal of the Acoustical Society of America. 1997;102(1):28-40.
Rabiner LR, Schafer RW., Theory and application of digital speech processing. Preliminary Edition. 2009.
How to Cite
TRANSFER OF COPYRIGHT AGREEMENT
The manuscript is herewith submitted for publication in the Journal of Telecommunication, Electronic and Computer Engineering (JTEC). It has not been published before, and it is not under consideration for publication in any other journals. It contains no material that is scandalous, obscene, libelous or otherwise contrary to law. When the manuscript is accepted for publication, I, as the author, hereby agree to transfer to JTEC, all rights including those pertaining to electronic forms and transmissions, under existing copyright laws, except for the following, which the author(s) specifically retain(s):
- All proprietary right other than copyright, such as patent rights
- The right to make further copies of all or part of the published article for my use in classroom teaching
- The right to reuse all or part of this manuscript in a compilation of my own works or in a textbook of which I am the author; and
- The right to make copies of the published work for internal distribution within the institution that employs me
I agree that copies made under these circumstances will continue to carry the copyright notice that appears in the original published work. I agree to inform my co-authors, if any, of the above terms. I certify that I have obtained written permission for the use of text, tables, and/or illustrations from any copyrighted source(s), and I agree to supply such written permission(s) to JTEC upon request.