Automatic Phoneme Identification for Malay Dialects
Keywords:Phoneme Identification, Malay Dialect, Multilingual, Text Transcript,
AbstractIn many languages such as English, French, German, and Mandarin, there is a documented way of how words are pronounced. The pronunciation of a word is determined by the sequence of phonemes or some speech sounds. Each language or dialect might have different phoneme set. However, there is often a lack of phonological study for a dialect. The number of phonemes is unknown for some of the dialects or languages without a written form. In this work, we propose an approach to identify the phonemes for a dialect from the dialect text transcript and speech corpus, leveraging on existing resources from standard language and multilingual resources. Our study was carried out on Malay dialects. The result shows that the accuracy of the phoneme identification approach is high when we compare the results against previous works in the area.
Y. M. Maris, The Malay Sound System, Malaysia: Siri Teks Fajar Bakti, 1979.
D. Reithaug, Orchestrating Success in Reading, Canada: Stirling Head Enterprises, 2002.
G. Norkevic ̌ ius, G. Raškinis and A. Kazlauskienė, "Knowledge-Based Grapheme-to-Phoneme Conversion of Lithuanian Words," in SPECOM 2005, 10th International Conference Speech and Compute, Greece, 2005.
T. P. Tan and B. Ranaivo-Malancon, "Malay Grapheme to Phoneme Tool for Automatic Speech Recognition," in Third International Workshop on Malay and Indonesian Language Engineering, Singapore, 2009.
S. Stuker and A. Waibel, "Towards Human Translations Guided Language Discovery for ASR Systems," in in SLTU, Hanoi, 2008.
S. Stuker, L. Besacier and A. Waibel, "Human Translations Guided Language Discovery for ASR Systems," in in Interspeech, Brighton, 2009.
L. Besacier, B. Zhou and Y. Gao, "Towards Speech Transla- tion of Non Written Languages," in in SLT, Aruba, 2006.
S. Sitaram, G. K. Anumanchipalli, J. Chiu, A. Parlikar and A. W. Black, "Text to Speech in New Languages without a Standardized Orthography," in in Speech Synthesis Workshop, 2013.
S. Sitaram, S. Palkar, Y. Chen, A. Parlikar and A. W. Black, "Bootstrapping Text-to-Speech for Speech Processing in Languages Without an Orthography," in in ICASSP, Canada, 2013.
F. Stahlberg, T. Schlippe, S. Vogel and T. Schultz, "Word Segmentation through Cross-Lingual Word-to-Phoneme Alignment," in in SLT, USA, 2012.
F. Stahlberg, T. Schlippe, S. Vogel and T. Schultz, "Pronunciation Extraction from Phoneme Sequences through Cross- Lingual Wordto-Phoneme Alignment," in in SLSP, Tarragona, 2013.
O. Martirosian and M. Davel, "Error Analysis of a Public Do- main Pronunciation Dictionary," in in PRASA, 2007.
N. Rezaei and A. Salehi, "An Introduction to Speech Sciences (Acoustic Analysis of Speech)," Iranian Rehabilitation Journal, vol. 4, no. 4, pp. 5-14, 2006.
J. T. Colins, "Malay Dialect Research in Malaysia: the Issue of Perspective," Bijdragen tot de Taal-, Land- en Volkenkunde, pp. 235-264, 1989.
H. O. Asmah, Aspek Bahasa dan Kajiannya, Kuala Lumpur: Dewan Bahasa dan Pustaka, 1991.
Z. B. Ahmad, The Phonology & Morphology of the Perak Dialect, Kuala Lumpur: Dewan Bahasa dan Pustaka, 1991.
P. Ladefoged, Vowels and Consonants: An Introduction to the Sound of Languages, United Kingdom: Black Well Publishing, 2000.
N. Schmitt, A. Winkler, M. Boretzki and I. Holube, "A Phoneme Perception Test Method for High-Frequency Hearing Aid Fitting," Journal of the American Academy of Audiology fast track, vol. 27, p. 367–379, 2016.
R. D. Kent and C. Read, The Acoustic Analysis of Speech, Canada: Singular/Thomson Learning, 2002.
N. S. Kenneth, Acoustic Phonetics (Current Studies in Linguistics), Cambridge, MA: MIT., 2000.
X. D. Huang, A. Acero and H. W. Hon, Spoken Language Processing: A Guide to Theory, Algorithm, and System Development, New Jersey: Prentice Hall PTR, 2001.
A. Cole, Y. K. Muthusamy and B. T. Oshika, "The OGI Multilanguage Telephone Speech Corpus," in In Proc ICSLP'92, Banff, 1992.
O. Andersen, P. Dalsgaard and W. Barry, "Data-Driven Identification of Poly- and Mono-phonemes for four European Languages," in Proceedings of EUROSPEECH'93, Berlin, 1993.
A. J. Bosman, Speech perception by the hearing impaired, Netherlands: Doctorial thesis, University of Utrecht, 1989.
S. Gokcen and J. M. Gokcen, "A Multilingual Phoneme and Model Set: Toward a Universal Base for Automatic Speech Recognition," in Automatic Speech Recognition and Understanding, Proceedings., IEEE Workshop on, 1997.
L. Rabiner, "A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition," in Proceedings of the IEEE, 1989.
A. K. Kienappel, D. Geller and R. Bippus, "Cross-Language Transfer Of Multilingual Phoneme Models," in ASR2000 - Automatic Speech Recognition: Challenges for the new Millenium Paris, France, 2000.
T. P. Tan, Automatic Speech Recognition for Non-Native Speakers, France: Universit́ e Joseph-Fourier - Grenoble I, 2008.
M. K. Ravishankar, "Sphinx3 Decoders: Online," 2006. [Online]. Available: http://cmusphinx.sourceforge.net/sphinx3/doc/s3_overview.html. [Accessed 4 April 2017].
T. P. Tan and L. Besacier, "Improving Pronunciation Modeling for Non-native Speech Recognitio," in in Proc. Interspeech, Brisbane, 2008.
H. M. Abdul, Sintaksis Dialek Kelantan, Kuala Lumpur: Dewan Bahasa dan Pustaka, 2006.
H. O. Asmah, The Phonological Diversity of the Malay Dialects, Kuala Lumpur: Bahagian Pembinaan dan Pengembangan Bahasa, Dewan Bahasa dan Pustaka, 1977.
H. O. Asmah, Susur Galur Bahasa Melayu, Malaysia: Dewan Bahasa dan Pustaka, Kementerian Pendidikan, 1988.
How to Cite
TRANSFER OF COPYRIGHT AGREEMENT
The manuscript is herewith submitted for publication in the Journal of Telecommunication, Electronic and Computer Engineering (JTEC). It has not been published before, and it is not under consideration for publication in any other journals. It contains no material that is scandalous, obscene, libelous or otherwise contrary to law. When the manuscript is accepted for publication, I, as the author, hereby agree to transfer to JTEC, all rights including those pertaining to electronic forms and transmissions, under existing copyright laws, except for the following, which the author(s) specifically retain(s):
- All proprietary right other than copyright, such as patent rights
- The right to make further copies of all or part of the published article for my use in classroom teaching
- The right to reuse all or part of this manuscript in a compilation of my own works or in a textbook of which I am the author; and
- The right to make copies of the published work for internal distribution within the institution that employs me
I agree that copies made under these circumstances will continue to carry the copyright notice that appears in the original published work. I agree to inform my co-authors, if any, of the above terms. I certify that I have obtained written permission for the use of text, tables, and/or illustrations from any copyrighted source(s), and I agree to supply such written permission(s) to JTEC upon request.