Robust Extraction of Named Entity Including Unfamiliar Word

机译：强大地提取名称实体，包括陌生词

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a novel method to extract named entities including unfamiliar words which do not occur or occur few times in a training corpus using a large unannotated corpus. The proposed method consists of two steps. The first step is to assign the most similar and familiar word to each unfamiliar word based on their context vectors calculated from a large unannotated corpus. After that, traditional machine learning approaches are employed as the second step. The experiments of extracting Japanese named entities from IREX corpus and NHK corpus show the effectiveness of the proposed method.

机译：本文提出了一种提取的新方法，用于提取包括不熟悉的单词的命名实体，这些单词不会使用大型未解压语料库在培训语料库中发生或发生在训练语料库中。所提出的方法包括两个步骤。第一步是基于从大型未解析的语料库计算的上下文向量来为每个不熟悉的单词分配最相似和熟悉的单词。之后，使用传统的机器学习方法作为第二步。从IREX语料库和NHK语料库中提取日本命名实体的实验表明了该方法的有效性。

著录项

来源
《Association for Computational Linguistics Annual Meeting: Human Language Technologies》|2008年||共4页
会议地点
作者
Masatoshi Tsuchiya; Shinya Hida; Seiichi Nakagawa;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机软件;
关键词
入库时间 2022-08-20 19:48:29

相似文献

外文文献
中文文献
专利

1. Investigating the Combination of Bag of Words and Named Entities Approach in Tracking and Detection Tasks among Journalists [J] . Masnizah Mohd, Omar Mabrook A. Bashaddadh Journal of Information Science Theory and Practice . 2014,第4期

机译：调查新闻工作者中的跟踪和检测任务中的单词袋和命名实体方法的组合
2. Research on Pattern Representation Based on Keyword and Word Embedding in Chinese Entity Relation Extraction [J] . Feiyue Ye, Zhentao Qin Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2018,第4a131期

机译：基于中国实体关系提取中的关键字和单词嵌入的模式表示研究
3. Comparing general and specialized word embeddings for biomedical named entity recognition [J] . Rigo E. Ramos-Vargas, Israel Román-Godínez, Sulema Torres-Ramos PeerJ Computer Science . 2021,第a期

机译：比较生物医学命名实体识别的一般和专用词嵌入
4. Robust Extraction of Named Entity Including Unfamiliar Word [C] . Masatoshi Tsuchiya, Shinya Hida, Seiichi Nakagawa Association for Computational Linguistics Annual Meeting: Human Language Technologies;ACL-08: HLT . 2008

机译：鲁棒地提取包括陌生单词在内的命名实体
5. Learning for information extraction: From named entity recognition and disambiguation to relation extraction. [D] . Bunescu, Razvan Constantin. 2007

机译：学习信息提取：从命名实体识别和歧义消除到关系提取。
6. Medical Named Entity Extraction from Chinese Resident Admit Notes Using Character and Word Attention-Enhanced Neural Network [O] . Yan Gao, Yandong Wang, Patrick Wang, 2020

机译：使用字符和单词注意增强神经网络从中国居民入学笔记中提取医学名称实体
7. Active learning for ontological event extraction incorporating named entity recognition and unknown word handling [O] . 2016

机译：主动学习结合命名实体识别和未知词处理的本体事件提取

Robust Extraction of Named Entity Including Unfamiliar Word

摘要

著录项

相似文献

相关主题

期刊订阅