首页> 外国专利> AUTOMATIC WORD SPACING METHOD OF KOREAN USING SYLLABLE UNIT CONDITION PROBABILITY

AUTOMATIC WORD SPACING METHOD OF KOREAN USING SYLLABLE UNIT CONDITION PROBABILITY

机译:音节单位概率的韩文自动分词方法。

摘要

PURPOSE: An automatic word spacing method of Korean using syllable unit condition probability is provided to process a word spacing method with respect to a sentence prepared based on partial spacing words and a sentence having no space by using a statistical method instead of a vocabulary knowledge or a heuristic. CONSTITUTION: A hypothesis for a spacing words optimum pattern search is set(400). The maximum accumulated log probability is calculated based on the set hypothesis(402). An output string is obtained by searching a spacing words optimum pattern of a syllable inputted using the maximum accumulated log probability and the back pointer(404). In the hypothesis process, a space is generated when a transient is generated as the same state, and a syllable is generated when a transient is generated as a different state. One hypothesis has the latest "n-1" number syllable, an accumulated log probability, and a back pointer. The back pointer is used for sensing the previous hypothesis extracting the current hypothesis, and stores a time, a status and a pointer of the previous hypothesis.
机译:目的:提供一种使用音节单位条件概率的韩文自动单词间隔方法,以处理基于部分间隔单词的句子和没有空格的句子,通过使用统计方法代替词汇知识或使用统计方法来处理单词间隔方法启发式的构成:一个关于间隔词最佳模式搜索的假设被设定(400)。基于设定的假设来计算最大累积对数概率(402)。通过搜索使用最大累积对数概率和后向指针(404)输入的音节的间隔词最佳模式来获得输出字符串。在假设过程中,当瞬态被生成为相同状态时,将生成一个空格;当瞬态被生成为不同状态时,将生成一个音节。一个假设具有最新的“ n-1”个音节,一个累积对数概率和一个后向指针。后指针用于感测先前的假设以提取当前假设,并存储先前的假设的时间,状态和指针。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号