首页> 外文会议> >Research of segmentation of Chinese texts in Chinese search engine
【24h】

Research of segmentation of Chinese texts in Chinese search engine

机译:中文搜索引擎中中文文本分割的研究

获取原文

摘要

Segmenting Chinese texts into Chinese words is a very difficult problem. In this paper, a framework for a Chinese Internet search engine is presented. It discusses the characteristics and difficulties of segmentation of Chinese texts in Chinese search engines. The paper concludes that the correctness of Chinese segmentation is most important, and puts forward tactics for processing disambiguation of segmentation strings, new unknown words and stop words, and presents methods which satisfy the consistency of Chinese segmentation.
机译:将中文文本分割成中文单词是一个非常困难的问题。本文提出了中文互联网搜索引擎的框架。讨论了中文搜索引擎中中文文本分割的特点和难点。得出结论,中文分割的正确性是最重要的,并提出了处理分割字符串,新的未知词和停用词的歧义处理策略,并提出了满足中文分割一致性的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号