Word Boundary Identification for Myanmar Text Using Conditional Random Fields

机译：使用条件随机字段的缅甸文本的字边界识别

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper examines the effectiveness of conditional random fields (CRFs) when used to identify Myanmar word boundaries within a supervised framework. Existing approaches are based on the method of maximum matching which appears to suffer from problems relating to the manner in which Myanmar words are composed. In our experiments, the CRF approach is compared against a baseline based on maximum matching using dictionaries from the Myanmar Language Commission Dictionary (word only) and a manually segmented subset of the BTEC1 corpus. The experimental results show that the CRF model is able to achieve considerably higher F-scores on the segmentation task than the baseline, even when the baseline is allowed to use words from the test data in its dictionary.

机译：本文审查了条件随机字段（CRF）的有效性，用于识别监督框架内的缅甸字界。现有方法基于最大匹配的方法，这似乎遭受了与缅甸单词组成的方式有关的问题。在我们的实验中，基于使用来自缅甸语言委员会字典（仅限Word）的词典和BTEC1语料库的手动分段子集的最大匹配，将CRF方法与基线进行比较。实验结果表明，即使允许基线在其字典中使用来自测试数据中的单词，CRF模型也能够在分割任务上实现比基线相当高的F分数。

著录项

来源
《International Conference on Genetic and Evolutionary Computing》|2016年|xvii 470p.|共10页
会议地点
作者
Win Pa Pa; Ye Kyaw Thu; Andrew Finch; Eiichiro Sumita;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311-532;
关键词
manner; dictionary; baseline;

机译：方式;字典;基线;

相似文献

外文文献
中文文献
专利

1. Identification of Adopted Pali Words in Myanmar Text [J] . Zin Maung Maung International Journal of Computer Science Issues . 2012,第6期

机译：缅甸文字中采用的巴利语词的识别
2. Conditional random fields for clinical named entity recognition: A comparative study using Korean clinical texts [J] . Lee Wangjin, Kim Kyungmo, Lee Eun Young, Computers in Biology and Medicine . 2018,第期

机译：临床命名实体识别的条件随机字段：韩国临床文本的比较研究
3. Scene text recognition using a Hough forest implicit shape model and semi-Markov conditional random fields [J] . Seok Jae-Hyun, Kim Jin Hyung Pattern Recognition: The Journal of the Pattern Recognition Society . 2015,第11期

机译：使用霍夫森林隐式形状模型和半马尔可夫条件随机场进行场景文本识别
4. Word Boundary Identification for Myanmar Text Using Conditional Random Fields [C] . Win Pa Pa, Ye Kyaw Thu, Andrew Finch, International Conference on Genetic and Evolutionary Computing . 2016

机译：使用条件随机字段的缅甸文本的字边界识别
5. SELECTED TOPICS IN SPATIAL STATISTICAL ANALYSIS: NONSTATIONARY VECTOR KRIGING, LARGE SCALE CONDITIONAL SIMULATION OF THREE-DIMENSIONAL GAUSSIAN RANDOM FIELDS, AND HYPOTHESIS TESTING IN A CORRELATED RANDOM FIELD [D] . QUIMBY, WILLIAM F. 1986

机译：空间统计分析中的选定主题：非平稳向量Kriging，三维高斯随机场的大规模条件模拟以及相关随机场中的假设检验
6. De-identifying Swedish clinical text - refinement of a gold standard and experiments with Conditional random fields [O] . Hercules Dalianis, Sumithra Velupillai 2010

机译：取消识别瑞典临床文本-完善金标准和条件随机场实验
7. A CONDITIONAL RANDOM FIELD APPROACH FOR FACE IDENTIFICATION IN BROADCAST NEWS USING OVERLAID TEXT [O] . Gay Paul, Khoury Elie, Meignier Sylvain, 2015

机译：使用oververtid文本在广播新闻中进行面部识别的条件随机场方法

Word Boundary Identification for Myanmar Text Using Conditional Random Fields

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅