Exploiting Wikipedia priori knowledge for Chinese named entity recognition

机译：利用Wikipedia先验知识进行中文命名实体识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Information Extraction is an important task in Natural Language Processing research. Named Entity Recognition as one of the basic tasks of information extraction, the effect has a great impact on the subsequent tasks such as Relation Extraction. And a major difficulty of NER lies in the unknown word identification. For this issue, method of exploiting Wikipedia external information methods was studied. Wikipedia is a rapid developing online encyclopedia in recent years. In 2016, the number of Chinese entries has reached 860,000. Huge valuable information will be provided to identify unknown words by Exploiting Wikipedia as external knowledge. The Wikipedia entries have been selected, and combined into the Conditional Random Field model of NER as features. The experimental studies demonstrate that this method can improve the effectiveness of NER significantly.

机译：信息提取是自然语言处理研究的重要任务。将实体识别作为信息提取的基本任务之一，其效果对诸如关系提取之类的后续任务有很大的影响。 NER的主要困难在于未知单词的识别。针对此问题，研究了利用Wikipedia外部信息方法的方法。维基百科是近年来发展迅速的在线百科全书。 2016年，中国参赛人数达到86万。通过利用Wikipedia作为外部知识，将提供大量有价值的信息来识别未知单词。已选择Wikipedia条目，并将其合并为NER的条件随机字段模型作为特征。实验研究表明，该方法可以显着提高NER的有效性。

著录项

来源
《2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery》|2016年|1548-1552|共5页
会议地点 Changsha(CN)
作者
Jianfeng Li; Conghui Zhu; Sheng Li; Tiejun Zhao; Dequan Zheng;
展开▼
作者单位

School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China;

School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China;

School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China;

School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China;

School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Encyclopedias; Electronic publishing; Internet; Training; Hidden Markov models; Labeling;

机译：百科全书;电子出版;互联网;培训;隐马尔可夫模型;标签;
入库时间 2022-08-26 14:01:46

相似文献

外文文献
中文文献
专利

1. Automatically building large-scale named entity recognition corpora from Chinese Wikipedia [J] . Jie?Zhou, Bi-cheng?Li, Gang?Chen Frontiers of Information Technology & Electronic Engineering . 2015,第11期

机译：从中文维基百科自动建立大规模的命名实体识别语料库
2. Automatically building large-scale named entity recognition corpora from Chinese Wikipedia [J] . Jie ZHOU, Bi-cheng LI, Gang CHEN 浙江大学学报（英文版）（C辑：计算机与电子） . 2015,第011期

机译：从中文维基百科自动建立大规模的命名实体识别语料库
3. Exploiting Multilingual Wikipedia to Improve Arabic Named Entity Resources [J] . Biltawi Mariam, Awajan Arafat, Tedmori Sara, The international arab journal of information technology . 2017,第4a期

机译：利用多语言维基百科改善阿拉伯命名实体资源
4. Exploiting Wikipedia Priori Knowledge for Chinese Named Entity Recognition [C] . Jianfeng Li, Conghui Zhu, Sheng Li, International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery . 2016

机译：利用Wikipedia先验知识为中文命名实体认可
5. Enabling Entity Retrieval by Exploiting Wikipedia as a Semantic Knowledge Source. [D] . Jeon, Sofia. 2011

机译：通过利用Wikipedia作为语义知识源来启用实体检索。
6. Exploiting and assessing multi-source data for supervised biomedical named entity recognition [O] . Dieter Galea, Ivan Laponogov, Kirill Veselkov -1

机译：开发和评估用于监督生物医学命名实体识别的多源数据
7. Improving Multilingual Named Entity Recognition with Wikipedia Entity Type Mapping [O] . Ni, Jian, Florian, Radu 2017

机译：使用维基百科实体改进多语言命名实体识别类型映射

Exploiting Wikipedia priori knowledge for Chinese named entity recognition

摘要

著录项

相似文献

相关主题

期刊订阅