首页> 外文期刊>Journal of information science and engineering >Extracting Chinese Frequent Strings Without a Dictionary From a Chinese Corpus and its Applications
【24h】

Extracting Chinese Frequent Strings Without a Dictionary From a Chinese Corpus and its Applications

机译:从汉语语料库中提取不带字典的汉语常用字串及其应用

获取原文
获取原文并翻译 | 示例
       

摘要

This paper describes how to extract Chinese frequent strings without using a dic- tionary. In this paper, we generalize the notations of words and unknown words to those of frequent strings. The Chinese frequent strings (CFSs) we define include words, unknown words, and other strings that are frequently used. Some examples of CFSs are "(can only let)", "(every minute and every second)", " (bearing in mind the interest of each other)", and "(and nobody)". A CFS is very useful in Chinese natural language processing and its related applications.
机译:本文介绍了如何在不使用字典的情况下提取中文频繁字符串。在本文中,我们将单词和未知单词的符号概括为常用字符串的符号。我们定义的中文常用字符串(CFS)包括单词,未知单词和其他常用字符串。 CFS的一些示例是“(只能允许)”,“(每分钟和每秒)”,“(考虑到彼此的利益)”和“(并且没有人)”。 CFS在中文自然语言处理及其相关应用中非常有用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号