A combined method for automatic domain-specific Terminology extraction

机译：一种自动提取特定领域术语的组合方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present a Terminology extraction algorithm combining with machine learning and corpus-based statistical model. We collect a balanced corpus with all the possible nominal terms of every domain annotated, and take this corpus as training corpus. After selecting training features for terms, we use SVM to recognize terminological candidates in target corpus. Then we calculate the Domain Relevance (DR) and Domain Consensus (DC) scores for the terminological candidates to acquire domain-specific Terminologies. We make 4 experiments on Tourism corpus and short sentences with two kinds of balanced training corpora. Furthermore, we evaluate the precision and recall of our Terminology extraction algorithm by comparing the words in a golden standard with the words extracted by our system. The experiments show that our algorithm can get improved result in automatic extraction of nominal domain-specific Terminologies. A detailed analysis shows the advantages and disadvantages of our algorithm.

机译：在本文中，我们提出了一种结合了机器学习和基于语料库的统计模型的术语提取算法。我们收集平衡的语料库，并在每个域中标注所有可能的名义条款，并将该语料库作为训练语料库。为术语选择训练功能后，我们使用SVM识别目标语料库中的术语候选者。然后，我们为候选术语计算域相关性（DR）和域共识（DC）分数，以获取特定于域的术语。我们用两种平衡训练语料库对旅游语料库和短句进行了4个实验。此外，我们通过将黄金标准中的单词与系统提取的单词进行比较，来评估术语提取算法的准确性和召回率。实验表明，我们的算法在自动提取标称领域专有术语方面可以获得改进的结果。详细分析显示了我们算法的优缺点。

著录项

来源
《2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery》|2011年|p.1734-1737|共4页
会议地点
作者
Liu Li; Qi Quan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
GATE; SVM; Terminology; domain consensus; domain relevance;

机译：GATE; SVM;术语;领域共识;领域相关性;

相似文献

外文文献
中文文献
专利

1. Semi-automatic extraction of multiword terms from domain-specific corpora [J] . Vesna Pajic, Stasa Vujicic Stankovic, Ranka Stankovic, The Electronic Library . 2018,第3期

机译：从特定领域语料库中半自动提取多词术语
2. DEXTER: Automatic Extraction of Domain-Specific Glossaries for Language Teaching [J] . Carlos Peri?án-Pascual, Eva M. Mestre-Mestre Procedia - Social and Behavioral Sciences . 2015,第2期

机译：DEXTER：自动提取用于语言教学的特定领域词汇表
3. Methods for automatic term recognition in domain-specific text collections: A survey [J] . Astrakhantsev N. A., Fedorenko D. G., Turdakov D. Yu. Programming and Computer Software . 2015,第6期

机译：特定领域文本集中的自动术语识别方法：一项调查
4. A combined method for automatic domain-specific Terminology extraction [C] . Liu Li, Qi Quan International Conference on Fuzzy Systems and Knowledge Discovery . 2011

机译：一种自动域特定术语提取的组合方法
5. Knowledge-based methods for automatic extraction of domain-specific ontologies. [D] . Punuru, Janardhana R. 2007

机译：用于自动提取特定领域本体的基于知识的方法。
6. Supporting the use of standardized nursing terminologies with automatic subject heading prediction: a comparison of sentence-level text classification methods [O] . Hans Moen, Kai Hakala, Laura-Maria Peltonen, 2020

机译：支持标准护理术语与自动主题预测的使用：句子级文本分类方法的比较
7. Automatic Access to Legal Terminology Applying Two Different Automatic Term Recognition Methods [O] . Pérez María José Marín, Rizzo Camino Rea 2013

机译：使用两种不同的自动术语识别方法自动访问法律术语
8. Semi-Automated Methods for Refining a Domain-Specific Terminology Base [R] . Rose, G., Holland, M., Larocca, S., 2011

机译：细化特定领域术语库的半自动化方法

A combined method for automatic domain-specific Terminology extraction

摘要

著录项

相似文献

相关主题

期刊订阅