首页> 外文会议>International Conference on Computational Linguistics >Chinese Term Extraction Using Minimal Resources
【24h】

Chinese Term Extraction Using Minimal Resources

机译:使用最小资源的中文术语提取

获取原文

摘要

This paper presents a new approach for term extraction using minimal resources. A term candidate extraction algorithm is proposed to identify features of the relatively stable and domain independent term delimiters rather than that of the terms. For term verification, a link analysis based method is proposed to calculate the relevance between term candidates and the sentences in the domain specific corpus from which the candidates are extracted. The proposed approach requires no prior domain knowledge, no general corpora, no full segmentation and minimal adaptation for new domains. Consequently, the method can be used in any domain corpus and it is especially useful for resource-limited domains. Evaluations conducted on two different domains for Chinese term extraction show quite significant improvements over existing techniques and also verify the efficiency and relative domain independent nature of the approach. Experiments on new term extraction also indicate that the approach is quite effective for identifying new terms in a domain making it useful for domain knowledge update.
机译:本文介绍了使用最小资源提取的新方法。提出了术语候选提取算法,以识别相对稳定和域独立术语分隔符的特征,而不是术语。对于术语验证,提出了一种基于链路分析的方法来计算候选人与提取候选者的特定语料库中的术语候选和句子之间的相关性。拟议的方法不需要现有的域名知识,没有一般性公司,没有全面分割和新域的最小适应。因此,该方法可以用于任何域语料库,并且对于资源限制域特别有用。对中国术语提取的两个不同域进行的评估表现出对现有技术的显着改进,并验证了这种方法的效率和相关领域的独立性质。新术语提取的实验还表明该方法非常有效地识别域中的新术语,使其有用于域知识更新。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号