Recognition Method of Important Words in Korean Text based on Reinforcement Learning

机译：基于强化学习的韩文文本重要词汇识别方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The manual labeling work for constructing the Korean corpus is too time-consuming and laborious. It is difficult for low-minority languages to integrate resources. As a result, the research progress of Korean language information processing is slow. From the perspective of representation learning, reinforcement learning was combined with traditional deep learning methods. Based on the Korean text classification effect as a benchmark, and studied how to extract important Korean words in sentences. A structured model Information Distilled of Korean (IDK) was proposed. The model recognizes the words in Korean sentences and retains important words and deletes non-important words. Thereby transforming the reconstruction of the sentence into a sequential decision problem. So you can introduce the Policy Gradient method in reinforcement learning to solve the conversion problem. The results show that the model can identify the important words in Korean instead of manual annotation for representation learning. Furthermore, compared with traditional text classification methods, the model also improves the effect of Korean text classification.

机译：制造韩国语料库的手动标签工作太耗时和费力。低少数民族语言难以整合资源。因此，韩语信息处理的研究进展缓慢。从代表学习的角度来看，加固学习与传统的深度学习方法相结合。基于韩国文本分类效果作为基准，并研究了如何提取句子中的重要韩语单词。提出了蒸馏韩国（IDK）的结构化模型信息。该模型识别韩语句子中的单词并保留重要的单词并删除非重要词语。从而将句子的重建转变为序贯决策问题。因此，您可以在加强学习中介绍策略渐变方法来解决转换问题。结果表明，该模型可以识别韩语中的重要词语而不是用于表示学习的手动注释。此外，与传统文本分类方法相比，该模型还提高了韩文文本分类的影响。

著录项

来源
《Chinese National Conference on Computational Linguistic》|2020年|1017-1025|共9页
会议地点
作者
Feiyang Yang; Yahui Zhao; Rongyi Cui;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Word embedding and text classification based on deep learning methods [J] . Saihan Li, Bing Gong MATEC Web of Conferences . 2021,第a期

机译：基于深度学习方法的单词嵌入和文本分类
2. A Chinese unknown word recognition method for micro-blog short text based on improved FP-growth [J] . Pattern Analysis and Applications . 2020,第2期

机译：基于改进的FP增长的微博短文本中文未知词识别方法
3. A Novel Scene Text Recognition Method Based on Deep Learning [J] . Maosen Wang, Shaozhang Niu, Zhenguang Gao Computers, Materials & Continua . 2019,第2期

机译：基于深度学习的新颖文本识别方法
4. Recognition Method of Important Words in Korean Text Based on Reinforcement Learning [C] . Feiyang Yang, Yahui Zhao, Rongyi Cui China National Conference on Computational Linguistics . 2020

机译：基于强化学习的韩文文本重要词汇识别方法
5. Training a Neural Network to Construct Sentences from an Inputted Word List: A Comparison Between Supervised and Reinforcement Learning Methods [D] . Black, Samuel 2018

机译：训练神经网络以从输入的单词列表构建句子：监督学习和强化学习方法之间的比较
6. A Study of Active Learning Methods for Named Entity Recognition in Clinical Text [O] . Yukun Chen, Thomas A. Lasko, Qiaozhu Mei, -1

机译：主动学习方法在临床文本中识别实体的研究
7. Learning the lexicon from raw texts for open-vocabulary Korean word recognition [O] . Sungho Ryu, Jin Hyung Kim 2010

机译：从原始文本中学习词汇，以进行开放式韩语单词识别

Recognition Method of Important Words in Korean Text based on Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅