首页> 外文会议>International Conference on Cloud Computing and Security >A New Dependency Parsing Tree Generation Algorithm Based on the Semantic Dependency Relationship Between Words
【24h】

A New Dependency Parsing Tree Generation Algorithm Based on the Semantic Dependency Relationship Between Words

机译:一种基于单词语义依赖关系的新依赖性解析树生成算法

获取原文

摘要

In this paper it presents a new dependency parsing tree (DPT) generation algorithm. Different from other similar algorithms, which based on statistical probability model, the algorithm converts the dependency parsing tree generation problem into a semantic segments dividing problem. In this paper, the co-occurrence frequency of words is firstly analyzed, and it is pointed out that the co-occurrence frequency of words can be used as the basis for the judgment of semantic dependence relationship between words. Then it further analyzes the change of co-occurrence frequency entropy of words in a semantic unit (sentence is used as the basic semantic unit in this paper). And we present an algorithm to divide a sentence into semantic fragments in which words has tight semantic relationship with each other. Based on the above work, this paper divides the DPT generation algorithm into three steps. The first step is to divide the sentence into semantic fragments. The second step is to distinguish semantic core word and non-semantic core words according to the semantic dependency relationship between words in a semantic fragment. Then in the last step the DPT is generated according semantic dependency relationship between semantic core words. Based on court documents which collected from web, the experiments of our DPT generation algorithm are conducted in this paper. And the results show that the DPT generation algorithm in this paper maintains a high degree of consistency with the DPT tree generated by human.
机译:在本文中它提出了一个新的依存分析树(DPT)生成算法。从其他类似的算法,其基于统计概率模型不同,则算法的依存分析树生成问题转换成语义段划分的问题。在本文中,词的同现频率首先进行分析,指出词语的同现频率可以被用作字之间的语义依存关系的判断的基础。然后进一步分析词语的同现频率的熵在语义单元中的变化(句子被用作基本语义单元在本文中)。我们提出了一种算法,以一个句子分成语义片段中哪些词语具有相互紧密语义关系。基于上述工作,本文划分DPT生成算法分为三个步骤。所述第一步骤是将句子划分成语义片段。第二步骤是根据在语义片段词语之间的语义依赖关系来区分语义核心字和非语义核心词。然后,在最后步骤中DPT根据语义核心词之间的语义依赖关系生成。在此基础上,从网络收集的法庭文件,我们DPT生成算法的实验,在本文中被进行。而结果表明,本文的DPT生成算法保持与人类产生的DPT树高度的一致性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号