Error feedback based lexical entity extraction for Chinese language modeling

机译：基于错误反馈的汉语建模词汇实体提取

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Chinese, which is quite different from western languages, has no standard definition of word. Therefore, choosing suitable lexicon plays an important role in Chinese language modeling. This paper proposes a novel method of constructing the lexicon automatically. Other than depending on statistical measures of text features, this method is directly based on the feedback of errors from the corresponding task, such as phoneme-to-grapheme conversion in this paper. The whole process consists of two iterative phases: selection of individual words from a large manual lexicon and further extraction of compound words based on Phase One. Experiments implemented on phoneme-to-grapheme conversion show that this method can achieve 1.09% and 0.38% absolute reduction in character error rate respectively for Phase One and Phase Two compared with baseline lexicons in the same size generated by the conventional method based on word frequency.

机译：中文与西方语言完全不同，没有标准的单词定义。因此，选择合适的词典在中文建模中起着重要的作用。本文提出了一种自动构建词典的新方法。除了依赖于文本特征的统计度量外，该方法还直接基于相应任务的错误反馈，例如本文中的音素到音素转换。整个过程包括两个迭代阶段：从大型手动词典中选择单个单词，然后根据第一阶段进一步提取复合单词。在音素到音素转换上进行的实验表明，与基于词频的常规方法生成的相同大小的基线词典相比，该方法可以分别实现第一阶段和第二阶段的字符错误率绝对降低1.09％和0.38％。

著录项

来源
《International Congress on Image and Signal Processing》|2013年|1298-1303|共6页
会议地点
作者
Liu Yi; Hua Jing; Li Xiangang; Wu Xihong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Chinese language modeling; error feedback; lexical entity extraction; lexical entity selection; phoneme-to-grapheme conversion;

机译：中文建模;错误反馈;词汇实体提取;词汇实体选择;音素到字素转换;

相似文献

外文文献
中文文献
专利

1. Joint Pre-Trained Chinese Named Entity Recognition Based on Bi-Directional Language Model [J] . Ma Changxia, Zhang Chen International Journal of Pattern Recognition and Artificial Intelligence . 2021,第9期

机译：基于双向语言模型的联合预先培训的中文命名实体识别
2. A Novel Chinese Entity Relationship Extraction Method Based on the Bidirectional Maximum Entropy Markov Model [J] . Chengyao Lv, Deng Pan, Yaxiong Li, Complexity . 2021,第a期

机译：基于双向最大熵的新型中文实体关系提取方法Markov模型
3. A lexical knowledge base approach for English-Chinese cross-language information retrieval [J] . Chen JP Journal of the American Society for Information Science and Technology . 2006,第2期

机译：英汉跨语言信息检索的词汇知识库方法
4. Error Feedback Based Lexical Entity Extraction for Chinese Language Modeling [C] . Yi Liu, Jing Hua, Xiangang Li, International Congress on Image and Signal Processing . 2013

机译：基于误差反馈中文建模的词汇实体提取
5. The construction, use, and evaluation of a lexical knowledge base for English-Chinese cross-language information retrieval. [D] . Chen, Jiangping. 2003

机译：英汉跨语言信息检索的词汇知识库的构建，使用和评估。
6. How does language change as a lexical network? An investigation based on written Chinese word co-occurrence networks [O] . Heng Chen, Xinying Chen, Haitao Liu 2012

机译：语言如何作为词汇网络发生变化？基于书面汉字共现网络的调查
7. Assessing Chinese Students’ Writing Performance in an American University: The Relationship between Selected Written Errors, Teacher’s Feedback, and Learners’ Interlanguage Experiences [O] . Yuxin Tian 2019

机译：评估中国学生在美国大学的写作表现：所选书面错误，教师反馈和学习者的中介语的关系

Error feedback based lexical entity extraction for Chinese language modeling

摘要

著录项

相似文献

相关主题

期刊订阅