|
Publication
|
Research Title |
Isarn Dialect Word Segmentation using Bi-directional Gated Recurrent Unit with transfer learning approach |
Date of Distribution |
22 December 2022 |
Conference |
Title of the Conference |
The International Computer Science and Engineering Conference 2022 (ICSEC) |
Organiser |
IEEE Thailand Section, IEEE ComSoc Thailand Chapter, and ECTI Association Thailand |
Conference Place |
Kasetsart University, Chalermphrakiat Sakon Nakhon Province Campus |
Province/State |
Sakon Nakhon |
Conference Date |
21 December 2022 |
To |
23 December 2022 |
Proceeding Paper |
Volume |
- |
Issue |
- |
Page |
156 |
Editors/edition/publisher |
IEEE Xplore |
Abstract |
This paper presents an Isarn dialect word segmentation based on a recurrent neural network. In this study, the Isarn text written in Thai script is taken as input. We explored the effectiveness of the types of recurrent layers; recurrent neural networks (RNN), gated recurrent units (GRU), and long short-term memory (LSTM). The F1-scores of RNN, GRU, and LSTM are 95.36, 96.05, and 95.70, respectively. The experiment results showed that using GRU as the recurrent layer achieved the best performance. To deal with borrowed words from Thai, transfer learning was applied to improve the performance of the model by fine-tuning the pre-trained model given the limited size of the Isarn corpus. The model trained through the transfer learning approach outperformed the model trained from the Isarn dataset alone. |
Author |
|
Peer Review Status |
มีผู้ประเมินอิสระ |
Level of Conference |
นานาชาติ |
Type of Proceeding |
Full paper |
Type of Presentation |
Oral |
Part of thesis |
true |
Presentation awarding |
false |
Attach file |
|
Citation |
0
|
|
|
|
|
|
|