首页> 外文会议>7th International Conference on Natural Language Processing and Knowledge Engineering >Chinese sentence compression based on statistics probability and dependency analysis
【24h】

Chinese sentence compression based on statistics probability and dependency analysis

机译:基于统计概率和依存关系分析的中文句子压缩

获取原文
获取原文并翻译 | 示例

摘要

In this paper we describe a Chinese sentence compression tool, which makes use of various technologies. We first describe how to prepare our own Chinese training corpus and from what we learn the removal and non-removal probabilities. Then we bring in a knowledge base to preprocess the input sentences. Following that, how an input sentence gets analyzed is described, using Chinese word segmentation, a pos-tagging removal program and a shallow parser which outputs the collapsed dependencies. Next, we use the removal probabilities and compression rules to finish the task. Experimental results have shown that this method is feasible.
机译:在本文中,我们描述了一种利用各种技术的中文句子压缩工具。我们首先描述如何准备自己的中文培训语料库,以及从中学到的撤除和非撤除概率。然后,我们引入一个知识库来预处理输入的句子。然后,使用中文分词,pos-tags去除程序和输出折叠后的依存关系的浅层解析器,描述了如何分析输入句子。接下来,我们使用删除概率和压缩规则来完成任务。实验结果表明该方法是可行的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号