首页> 外文会议>Computational Linguistics and Intelligent Text Processing >SIGNUM: A Graph Algorithm for Terminology Extraction
【24h】

SIGNUM: A Graph Algorithm for Terminology Extraction

机译:SIGNUM:用于术语提取的图算法

获取原文
获取原文并翻译 | 示例

摘要

Terminology extraction is an essential step in several fields of natural language processing such as dictionary and ontology extraction. In this paper, we present a novel graph-based approach to terminology extraction. We use SIGNUM, a general purpose graph-based algorithm for binary clustering on directed weighted graphs generated using a metric for multi-word extraction. Our approach is totally knowledge-free and can thus be used on corpora written in any language. Furthermore it is unsupervised, making it suitable for use by non-experts. Our approach is evaluated on the TREC-9 corpus for filtering against the MESH and the UMLS vocabularies.
机译:术语提取是自然语言处理的多个领域(例如字典和本体提取)中必不可少的步骤。在本文中,我们提出了一种新颖的基于图的术语提取方法。我们使用SIGNUM,这是一种基于通用图的算法,用于对使用度量进行多字提取的有向加权图进行二进制聚类。我们的方法完全没有知识,因此可以用任何语言编写的语料库使用。此外,它不受监督,适合非专家使用。我们的方法在TREC-9语料库上进行了评估,以针对MESH和UMLS词汇进行过滤。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号