首页> 外文会议>2009 International Conference on Signal Processing Systems >Feature-Based Approach to Chinese Term Relation Extraction
【24h】

Feature-Based Approach to Chinese Term Relation Extraction

机译:基于特征的汉语术语关系抽取方法

获取原文

摘要

In this paper, we propose a feature-based Chinese term relation extraction approach that combined the advantages of both naive bayes algorithm and perceptron algorithm. A subset of the features was estimated in training data; another subset of the features was trained by discriminative function. The results demonstrate that the proposed hybrid algorithm almost always outperforms the naive bayes algorithms and perceptron algorithms whether the training set is small or not. On the other hand, a novel feature representation was proposed, which included term sequence feature, term appearance features and context information features. Comparing the previous method, long-range dependence was considered in the proposed feature representation, which add the position of feature into vector space model (VSM) and promotes the capability of feature representation. Further, punctuation feature is the important character for terms relation extraction.
机译:在本文中,我们提出了一种基于特征的中文术语关系提取方法,该方法结合了朴素贝叶斯算法和感知器算法的优点。在训练数据中估计了部分功能;特征的另一子集通过判别函数进行训练。结果表明,无论训练集是否小,提出的混合算法几乎总是优于朴素贝叶斯算法和感知器算法。另一方面,提出了一种新颖的特征表示方法,包括术语序列特征,术语外观特征和上下文信息特征。与以前的方法相比,在提出的特征表示中考虑了远程依赖性,这将特征的位置添加到向量空间模型(VSM)中,并提高了特征表示的能力。此外,标点符号特征是术语关系提取的重要特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号