【24h】

Chinese Lettered-words Extraction for Language Monitoring

机译:中文字母词提取用于语言监控

获取原文

摘要

Lettered-words are frequently used in Chinese.Lettered-word falls into two parts.They are lettered-word with Chinese characters and letteredword without Chinese characters.Chinese characters in lettered-word have no specialization.When letteredword with Chinese characters are scattered in Chinese texts,it is difficult to recognize the boundaries.As a result,lettered-word with Chinese characters becomes a difficulty for lettered-words identification and extraction.In this paper,a method to extract lettered-words with Chinese characters and lettered-words without Chinese characters separately is proposed for the first time.An experiment on language monitoring of lettered-words using shows that the proposed method achieves a high recall and precision.
机译:字母词在中文中经常使用,字母词分为两个部分:有汉字的字母词和无汉字的字母词;有字母的汉字没有特殊性;当有汉字的字母词散落在中文中时因此,带有汉字的字母字变得难以识别和提取字母。本文提出了一种提取带有汉字的字母字和不使用字母字的字母字的方法。首次提出单独使用汉字。通过对字母词进行语言监控的实验表明,该方法具有较高的查全率和查准率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号