Thai Legal Term Correction using Random Forests with Outside-the-sentence Features

Takahiro Yamakoshi, Vee Satayamas, Hutchatai Chanlekha,Yasuhiro Ogawa,Takahiro Komamizu,Asanee Kawtrakul,Katsuhiko Toyama

semanticscholar(2019)

引用 0|浏览0
暂无评分
摘要
We propose a method for finding and correcting misused Thai legal terms in Thai statutory sentences. Our method predicts legal terms using Random Forest classifiers, each of which is optimized for each set of similar legal terms. Each classifier utilizes outsidethe-sentence features, namely, promulgation year, title keywords, and section keywords of statutes, in addition to words adjacent to the targeted legal term. Our experiment shows that our method outperformed not only a Random Forest method without the outside-thesentence features, but also BERT (Bidirectional Encoder Representations from Transformers), a powerful language representation model, in overall accuracy.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要