首页> 外国专利> CONSTRUCTING MARKOV MODELS OF WORDS FROM MULTIPLE UTTERANCES

CONSTRUCTING MARKOV MODELS OF WORDS FROM MULTIPLE UTTERANCES

机译:从多重话语构建单词的马尔可夫模型

摘要

ABSTRACTThe present invention addresses the problem ofconstructing fenemic baseforms which take intoaccount variations in pronunciation of words fromone utterance thereof to another. Specifically,the invention relates to a method of constructinga fenemic baseform for a word in a vocabulary ofword segments including the steps of: (a)transforming multiple utterances of the word intorespective strings of fenemes; (b) defining a setof fenemic Markov model phone machines; (c)determining the best single phone machine P1 forproducing the multiple feneme strings; (d)determining the best two phone baseform of theform P1P2 or P2P1 for producing the multiplefeneme strings; (e) aligning the best two phonebaseform against each feneme string; (f) splittingeach feneme string into a left portion and a rightportion with the left portion corresponding to thefirst phone machine of the two phone baseform andthe right portion corresponding to the secondphone machine of the two phone baseform; (g)identifying each left portion as a left substringand each right portion as a right substring; (h)processing the set of left substrings and the setof right substrings in the same manner as the setof feneme strings corresponding to the multipleutterances including the further step ofinhibiting further splitting of a substring whenthe single phone baseform thereof has a higherprobability of producing the substring than doesthe best two phone baseform; and (k) concatenatingthe unsplit single phones in an ordercorresponding to the order of the fenemesubstrings to which they correspond.
机译:抽象本发明解决了以下问题:构造将来自的单词发音中的帐户变化一个发话给另一个发话。特别,本发明涉及一种构建方法词汇表中单词的流行形式单词段,包括以下步骤:(a)将单词的多种发音转换为各自的仇恨串; (b)定义一套马尔可夫模型电话机(C)确定最佳的单电话机P1产生多个琴弦; (d)确定最佳的两个电话基础产生P1P2或P2P1的形式琴弦(e)排列最好的两个电话每个音位字符串的基本形式; (f)分裂每个音位串分为左部分和右部分左部分对应于两个电话基本形式的第一台电话机和对应于第二个的正确部分两个电话基本形式的电话机; (G)将每个左侧部分标识为左侧子字符串每个右侧部分作为右侧子字符串; (H)处理左子串和集合右子串的方式与集合相同对应于倍数的音素字符串话语包括进一步的步骤禁止在进一步拆分子字符串时其单一电话基础形式具有较高的产生子串的概率比最好的两个电话基础形式;和(k)串联订单中未拆分的单个电话对应于私刑的顺序它们对应的子字符串。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号