Integrating Ngram Model and Case-based LearningFor Chinese Word Segmentation

机译：Ngram模型与案例学习相结合的中文分词

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents our recent workfor participation in the First InternationalChinese Word Segmentation Bakeoff(ICWSB-1). It is based on a generalpurposengram model for word segmentationand a case-based learning approachto disambiguation. This system excelsin identifying in-vocabulary (IV) words,achieving a recall of around 96-98%.Here we present our strategies for languagemodel training and disambiguationrule learning, analyze the system's performance,and discuss areas for further improvement,e.g., out-of-vocabulary (OOV)word discovery.

机译：本文介绍了我们最近的工作参加第一国际中文分词烘烤（ICWSB-1）。它基于通用用于词分割的ngram模型和基于案例的学习方法消除歧义。这个系统擅长在识别词汇中的（IV）单词时，召回率约为96-98％。在这里，我们介绍我们的语言策略模型训练和消歧进行规则学习，分析系统性能，并讨论需要进一步改进的领域，例如，语音提示（OOV）单词发现。

著录项

来源
《41st annual meeting of the Association for Computational Linguistics : Proceedings of the conference》|2003年|1-4|共4页
会议地点
作者
Chunyu Kit; Zhiming Xu; Jonathan J. Webster;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序语言、算法语言;
关键词

相似文献

外文文献
中文文献
专利

1. Integrating Generative and Discriminative Character-Based Models for Chinese Word Segmentation [J] . KUN WANG, CHENGQING ZONG, KEH-YIH SU ACM transactions on Asian language information processing . 2012,第2期

机译：集成基于生成和判别字符的中文分词模型
2. Joint Chinese Word Segmentation and POS Tagging Using an Error-Driven Word-Character Hybrid Model [J] . Canasai KRUKNGKRA, Kiyotaka UCHIMOTO, Junichi KAZAMA, IEICE Transactions on Information and Systems . 2009,第12期

机译：使用错误驱动的字-字符混合模型的联合中文分词和POS标记
3. Learning Chinese Word Segmentation Based on Bidirectional GRU-CRF and CNN Network Model [J] . Chenghai Yu, Shupei Wang, Jiajun Guo International journal of technology and human interaction . 2019,第3期

机译：基于双向GRU-CRF和CNN网络模型的中文分词学习
4. Integrating Ngram Model and Case-based LearningFor Chinese Word Segmentation [C] . Chunyu Kit, Zhiming Xu, Jonathan J. Webster 41st annual meeting of the Association for Computational Linguistics : Proceedings of the conference . 2003

机译：结合Ngram模型和基于案例的学习进行中文分词
5. Word segmentation, word recognition, and word learning: A computational model of first language acquisition. [D] . Daland, Robert. 2009

机译：分词，单词识别和单词学习：母语习得的计算模型。
6. Speculation Detection for Chinese Clinical Notes: Impacts of Word Segmentation and Embedding Models [O] . Shaodian Zhang, Tian Kang, Xingting Zhang, -1

机译：中医临床笔记的推测检测：分词和嵌入模型的影响
7. Integrating Ngram Model and Case-based Learning for Chinese Word Segmentation [O] . Chunyu Kit, Zhiming Xu, Jonathan J. Webster 2008

机译：Ngram模型与案例学习相结合的中文分词

Integrating Ngram Model and Case-based LearningFor Chinese Word Segmentation

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅