Identification of Multiword Expressions in Technical Domains: Investigating Statistical and Alignment-Based Approaches

机译：识别技术领域的多字大表达：研究统计和基于对准的方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Multiword Expressions (MWEs) are one of the stumbling blocks for more precise Natural Language Processing (NLP) systems. The lack of coverage of MWEs in resources can impact negatively on the performance of tasks and applications, and can lead to loss of information or communication errors; especially in technical domains where MWE are frequent. This paper investigates some approaches to the identification of MWEs in technical corpora based on: association measures, part-of-speech and lexical alignment information. We examine the influence of some factors on their performance such as sources of information for identification and evaluation. While the association measures emphasize recall, the alignment method focuses on precision.

机译：多字型表达式（MWE）是更精确的自然语言处理（NLP）系统的绊脚石之一。资源中的MWE覆盖范围可能会对任务和应用的履行产生负面影响，并可能导致信息或沟通错误丢失;特别是在MWE频繁的技术领域中。本文根据以下：关联措施，言论自由和词汇对齐信息，调查了一些技术集团识别MWE的方法。我们研究一些因素对其绩效的影响，例如鉴定和评估信息来源。虽然关联措施强调召回，但对准方法侧重于精度。

著录项

来源
《Brazilian Symposium in Information and Human Language Technology》|2009年||共9页
会议地点
作者
Villavicencio Aline; Caseli Helena de Medeiros; Machado Andramp; #x0E9;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 G20-53;
关键词
Lexical Acquisition; Multiword Expressions; Natural Language Processing;

机译：词汇习得;ullword表达式;自然语言处理;
入库时间 2022-08-20 19:48:55

相似文献

外文文献
中文文献
专利

1. Alignment-based extraction of multiword expressions [J] . Helena de Medeiros Caseli, Carlos Ramisch, Maria das Gracas Volpe Nunes, Computers and the Humanities . 2010,第1a2期

机译：基于对齐的多词表达式提取
2. Identifying bacterial and archaeal homologs of pentameric ligand-gated ion channel (pLGIC) family using domain-based and alignment-based approaches. [J] . Rendon G, Kantorovitz MR, Tilson JL, Channels . 2011,第4期

机译：使用基于域和基于比对的方法鉴定五聚体配体门控离子通道（pLGIC）家族的细菌和古细菌同源物。
3. Identifying bacterial and archaeal homologs of pentameric ligand-gated ion channel (pLGIC) family using domain-based and alignment-based approaches [J] . Rendon Gloria, Kantorovitz Miriam R., Tilson Jeffrey L., Channels . 2011,第4期

机译：使用基于域和基于比对的方法鉴定五聚体配体门控离子通道（pLGIC）家族的细菌和古细菌同源物
4. Identification of Multiword Expressions in Technical Domains: Investigating Statistical and Alignment-Based Approaches [C] . Villavicencio Aline, Caseli Helena de Medeiros, Machado André 2009 Seventh Brazilian Symposium in Information and Human Language Technology . 2010

机译：技术领域中的多词表达识别：调查统计和基于对齐方式的方法
5. The Effects of Using Textual Enhancement on Processing and Learning Multiword Expressions [D] . Alshaikhi, Adel Zain. 2018

机译：使用文本增强对处理和学习多个表达的影响
6. Statistical Approaches for Gene Selection Hub Gene Identification and Module Interaction in Gene Co-Expression Network Analysis: An Application to Aluminum Stress in Soybean (Glycine max L.) [O] . Samarendra Das, Prabina Kumar Meher, Anil Rai, -1

机译：基因共表达网络分析中用于基因选择集线器基因识别和模块相互作用的统计方法：在大豆铝胁迫中的应用
7. Statistically-Driven Alignment-Based Multiword Expression Identification for Technical Domains [O] . Aline Villavicencio, André Machado, Maria José Finatto 2010

机译：基于统计驱动的比对的技术领域多词表达识别

Identification of Multiword Expressions in Technical Domains: Investigating Statistical and Alignment-Based Approaches

摘要

著录项

相似文献

相关主题

期刊订阅