首页> 外国专利> Method for dividing sentences into phrases using entropy calculations of word combinations based on adjacent words

Method for dividing sentences into phrases using entropy calculations of word combinations based on adjacent words

机译:利用基于相邻单词的单词组合的熵计算将句子分为短语的方法

摘要

The process of dividing sentences into phrases is automated. The sentence is divided into sub-sentences using statistical analysis. Then, the sub-sentences are into phrases, using statistical analysis. For example, for each pair of adjacent words in the sentence a metric is calculated which represents a strength of disconnection between the adjacent words. The sentence is divided into sub-sentences at locations in the sentence where the metric exceeds a first threshold.
机译:将句子分为短语的过程是自动的。使用统计分析将句子分为子句。然后,使用统计分析将子句分为短语。例如,对于句子中的每对相邻单词,计算度量,该度量表示相邻单词之间的断开强度。将该句子在度量中超过第一阈值的位置处分成子句。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号