Named entity recognition for Chinese telecommunications field based on Char2Vec and Bi-LSTMs

机译：基于Char2Vec和Bi-LSTM的中国电信领域命名实体识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Named Entity Recognition (NER) is a basic task in Natural Language Processing (NLP), which extracts the meaningful named entities from the text. Compared with the English NER, the Chinese NER is more challenge, since there is no tense in the Chinese language. Moreover, the omissions and the Internet catchwords in the Chinese corpus make the NER task more difficult. Traditional machine learning methods (e.g., CRFs) cannot address the Chinese NER effectively because they are hard to learn the complicated context in the Chinese language. To overcome the aforementioned problem, we propose a deep learning model Char2Vec+Bi-LSTMs for Chinese NER. We use the Chinese character instead of the Chinese word as the embedding unit, and the Bi-LSTMs is used to learn the complicated semantic dependency. To evaluate our proposed model, we construct the corpus from the China TELECOM FAQs. Experimental results show that our model achieves better performance than other baseline methods and the character embedding is more appropriate than the word embedding in the Chinese language.

机译：命名实体识别（NER）是自然语言处理（NLP）中的一项基本任务，该任务从文本中提取有意义的命名实体。与英语NER相比，中文NER更具挑战性，因为中文没有时态。而且，中文语料库中的遗漏和互联网流行语使NER任务更加困难。传统的机器学习方法（例如CRF）无法有效解决中文NER，因为它们很难学习中文的复杂上下文。为了克服上述问题，我们提出了针对中国NER的深度学习模型Char2Vec + Bi-LSTM。我们使用汉字而不是汉字作为嵌入单元，并使用Bi-LSTM来学习复杂的语义依赖性。为了评估我们提出的模型，我们从中国电信常见问题解答中构建了语料库。实验结果表明，我们的模型比其他基线方法具有更好的性能，并且字符嵌入比中文单词嵌入更合适。

著录项

来源
《International Conference on Intelligent Systems and Knowledge Engineering》|2017年|1-7|共7页
会议地点
作者
Yu Wang; Bin Xia; Zheng Liu; Yun Li; Tao Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Hidden Markov models; Logic gates; Computer architecture; Machine learning; Semantics; Telecommunications;

机译：隐马尔可夫模型;逻辑门;计算机体系结构;机器学习;语义;电信;

相似文献

外文文献
中文文献
专利

1. Rich features based Conditional Random Fields for biological named entities recognition. [J] . Sun C, Guan Y, Wang X, Computers in Biology and Medicine . 2007,第9期

机译：基于丰富特征的条件随机字段用于生物命名实体的识别。
2. Precursor-induced conditional random fields: connecting separate entities by induction for improved clinical named entity recognition [J] . Wangjin Lee, Jinwook Choi BMC Medical Informatics and Decision Making . 2019,第1期

机译：前体诱导的条件随机场：通过诱导连接单独的实体以改善临床命名实体的识别
3. Myanmar named entity corpus and its use in syllable-based neural named entity recognition [J] . Hsu Myat Mo, Khin Mar Soe International Journal of Electrical and Computer Engineering . 2020,第2期

机译：缅甸名为实体语料库及其在基于音节的神经名为实体识别中的用途
4. Named entity recognition for Chinese telecommunications field based on Char2Vec and Bi-LSTMs [C] . Yu Wang, Bin Xia, Zheng Liu, International Conference on Intelligent Systems and Knowledge Engineering . 2017

机译：基于Char2Vec和Bi-LSTM的中国电信字段命名实体识别
5. An Application of Natural Language Processing: Named Entity Recognition with BLSTM in Chinese Corpora [D] . Mao, Lihui 2019

机译：自然语言处理的应用：BLSTM在中文语料库中的命名实体识别
6. SBLC: a hybrid model for disease named entity recognition based on semantic bidirectional LSTMs and conditional random fields [O] . Kai Xu, Zhanfan Zhou, Tao Gong, 2018

机译：SBLC：基于语义双向LSTM和条件随机场的疾病命名实体识别混合模型
7. Chinese Named Entity Recognition based on Conditional Random Fields [O] . 向晓雯 2006

机译：基于条件随机场的中文命名实体识别

Named entity recognition for Chinese telecommunications field based on Char2Vec and Bi-LSTMs

摘要

著录项

相似文献

相关主题

期刊订阅