首页> 外文会议>2011 Sixth Annual ChinaGrid Conference >Discovering Chinese Compound Term Using Termhood and Unithood Measures
【24h】

Discovering Chinese Compound Term Using Termhood and Unithood Measures

机译:使用术语和单位度量法发现中文复合术语

获取原文

摘要

Domain terms play a crucial role in many research areas, which has led to a rise in demand for automatic domain terms extraction. In this paper, we present a two-level evaluation approach based on term hood and unit hood to extract Chinese domain compound terms automatically, which takes the character-level and word-level information into account. To achieve this, we incorporate semantic features by using the word segmentation to recognize single word terms, then leverage the improved C-value and heuristic methods such as word formation pattern and word formation power to evaluate candidates at both levels. By validating our approach with several existing dictionaries, a significant improvement of compound terms detection is achieved. Experiments in legal corpus show our method is superior over other compared methods.
机译:领域术语在许多研究领域中起着至关重要的作用,这导致对自动领域术语提取的需求增加。在本文中,我们提出了一种基于术语罩和单元罩的两级评估方法来自动提取中文域复合词,其中考虑了字符级和词级信息。为实现此目的,我们通过使用分词来识别单个词项,并结合了语义特征,然后利用改进的C值和启发式方法(例如构词模式和构词能力)来评估两个级别的候选者。通过使用几种现有词典验证我们的方法,可以显着改善复合词检测。法律语料库的实验表明,我们的方法优于其他比较方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号