首页> 外文会议>International Symposium on Computational Intelligence and Design >Word Segmentation Method Based on Inductive Learning and Segmentation Rule
【24h】

Word Segmentation Method Based on Inductive Learning and Segmentation Rule

机译:基于归纳学习和分割规则的词分割方法

获取原文

摘要

A word segmentation method based on Inductive Learning for non-segmented language uses only surface information of a character string; it has an advantage that is entirely not dependent on any specific language. The method extracts recursively a character string that occur frequently in text as word candidates, extracts segmentation rule with context information to deal with segmentation ambiguity. The method classifies those extracted word candidates to different ranking according to extraction situation, segments a text into words with extracted word candidates. Though proofread process erroneous segmentation was corrected, ranking of word candidates and segmentation rules was renewed. Evaluation experiments showed availability of the method for Japanese and Chinese word segmentation.
机译:基于非分段语言的归纳学习的词分割方法仅使用字符串的曲面信息;它具有完全不依赖于任何特定语言的优势。该方法递归地提取一个频繁发生在文本中的字符串,如Word候选,用上下文信息提取分段规则以处理分段歧义。该方法根据提取情况将提取的词候选分类为不同的排名,将文本分成与提取的单词候选的文本。虽然校对过程错误的分割被纠正,但续签了一词候选人和细分规则的排名。评估实验表明日语和中文分割方法的可用性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号