GSMIS

ข้อมูลการเผยแพร่ผลงาน

การเผยแพร่ในรูปของบทความวารสารทางวิชาการ

ชื่อบทความ

Isarn Dialect Speech Synthesis using HMM with syllable-context features

วัน/เดือน/ปี ที่ได้ตอบรับ

2 พฤศจิกายน 2561

วารสาร

ชื่อวารสาร

ECTI Transactions on Computer and Information Technology (ECTI-CIT)

มาตรฐานของวารสาร

SCOPUS

หน่วยงานเจ้าของวารสาร

The Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI) Association

ISBN/ISSN

ปีที่

ฉบับที่

เดือน

November

ปี พ.ศ. ที่พิมพ์

2561

หน้า

81-89

บทคัดย่อ

This paper describes the Isarn speech synthesis system, which is a regional dialect spoken in the Northeast of Thailand. In this study, we focus to improve the prosody generation of the system by using the additional context features. In order to develop the system, the speech parameters (Mel-ceptrum and fundamental frequencies of phoneme within different phonetic contexts) were modelled using Hidden Markov Models (HMM). Synthetic speech was generated by converting the input text into context-dependent phonemes. Speech parameters were generated from the trained HMM, according to the context-dependent phonemes, and were then synthesized through a speech vocoder. In this study, systems were trained using three different feature sets: basic contextual features, tonal, and syllable-context features. Objective and subjective tests were conducted to determine the performance of the proposed system. The results indicated that the addition of the syllable-context features significantly improved the naturalness of synthesized speech.

คำสำคัญ

Text-to-speech, speech synthesis, HMM, Isarn

ผู้เขียน

577020065-1	นาย พงษ์ศธร จันทร์ยอย [ผู้เขียนหลัก]
	คณะวิทยาศาสตร์ ปริญญาเอก ภาษาอังกฤษ

การประเมินบทความ

มีผู้ประเมินอิสระ

สถานภาพการเผยแพร่

ตีพิมพ์แล้ว

วารสารมีการเผยแพร่ในระดับ

นานาชาติ

citation

ไม่มี

เป็นส่วนหนึ่งของวิทยานิพนธ์

เป็น

ใช้สำหรับสำเร็จการศึกษา

ไม่เป็น

แนบไฟล์

Citation