International Journal of Scientific and Research Publications

IJSRP, Volume 2, Issue 9, September 2012 Edition [ISSN 2250-3153]


Dialect variation in Boro Language and Grapheme-to-Phoneme conversion rules to handle lexical lookup fails in Boro TTS System
      Chandan Sarma, Prof. P.H Talukdar
Abstract: It is not possible to include all the words in a natural language for general text-to-speech system. Grapheme-to-phoneme conversion system is essential to pronounce a word which is out of vocabulary. Grapheme-to-phoneme rules play a vital role where lexical lookup fails. Though basic Grapheme-to-phoneme rules system is very simple yet it is very powerful for naturalness of a TTS system. Letter-to-sound rules may be hand written or maybe automatic depending on the language. We have worked on Bodo language. After a systematic study of Boro language we found that there is a systematic relationship between the written form of a Bodo word and its pronunciation. So, it is fairly easy to build letter-to-sound rules by hand for Bodo language. We have used a Bodo corpora of 5000 words and built letter-to-sound rules. These rules have been tested using Festival, a most popular speech synthesizer and applying these rules, we were able to produce correct pronunciations for approximately 89% of the words. Again, dialect variation also influences grapheme-to-phoneme conversion rules. This paper gives overview of Boro dialect variation and grapheme-to-phoneme conversion rules developed for Boro TTS system.

Reference this Research Paper (copy & paste below code):

Chandan Sarma, Prof. P.H Talukdar (2018); Dialect variation in Boro Language and Grapheme-to-Phoneme conversion rules to handle lexical lookup fails in Boro TTS System; Int J Sci Res Publ 2(9) (ISSN: 2250-3153). http://www.ijsrp.org/research-paper-0912.php?rp=P09214
©️ Copyright 2011-2022 IJSRP - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.