Automatic filtration of multiword units

机译：多语单位自动过滤

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper studies how to filtrate multiword units. We use normalized expectation (NE) to extract multiword unit candidates from patent corpus. Then the multiword unit candidates are filtrated using stop words, frequency, first stop words, last stop words, and contextual entropy. The experimental result shows that the precision rate of multiword units is improved by 8.7% after filtration.

机译：本文研究如何滤除多语单位。我们使用标准化期望（NE）从专利语料库中提取多字单元候选。然后使用停止单词，频率，第一个停止单词，最后停止单词和上下文熵过滤多字单元候选。实验结果表明，过滤后，多语单词的精密率提高了8.7％。

著录项

来源
《International Conference on Natural Language Processing and Knowledge Engineering》|2010年||共4页
会议地点
作者
Ying Liu; Zheng Tie;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP312-53;
关键词
contextual entropy; extract; filtrate; multiword unit;

机译：语境熵;提取;滤液;多字单元;

相似文献

外文文献
中文文献
专利

1. A Method for Automatic Extraction of Multiword Units Representing Business Aspects From User Reviews [J] . Olga Vechtomova Journal of the American Society for Information Science and Technology . 2014,第7期

机译：一种自动从用户评论中提取代表业务方面的多字单元的方法
2. Semi-automatic extraction of multiword terms from domain-specific corpora [J] . Vesna Pajic, Stasa Vujicic Stankovic, Ranka Stankovic, The Electronic Library . 2018,第3期

机译：从特定领域语料库中半自动提取多词术语
3. Automatically learning semantic knowledge about multiword predicates [J] . Afsaneh Fazly, Suzanne Stevenson, Ryan North Language Resources and Evaluation . 2007,第1期

机译：自动学习有关多词谓词的语义知识
4. Automatic extraction and filtration of multiword units1 [C] . Liu Ying, Tie Zheng 2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery . 2011

机译：自动提取和过滤多字单元1
5. An analysis of the processing of multiword units in sentence reading and unit presentation using eye movement data: Implications for theories of MWUs. [D] . Columbus, Georgina. 2012

机译：使用眼动数据分析句子阅读和单元表示中多词单元的处理：对MWU理论的启示。
6. Individual Chunking Ability Predicts Efficient or Shallow L2 Processing: Eye-Tracking Evidence From Multiword Units in Relative Clauses [O] . Manuel F. Pulido 2020

机译：单个块能力预测有效或浅的L2处理：从相对条款中的多语单位的眼睛跟踪证据
7. A Multiword Unit Analysis : COCA Multiword Unit List 20 and ColloGram [O] . Dongkwang Shin, Yuah V. Chon 2019

机译：多语单位分析：COCA多字单元列表20和COLLGUBLE

Automatic filtration of multiword units

摘要

著录项

相似文献

相关主题

期刊订阅