首页> 外文期刊>Mobile information systems >KoRASA: Pipeline Optimization for Open-Source Korean Natural Language Understanding Framework Based on Deep Learning
【24h】

KoRASA: Pipeline Optimization for Open-Source Korean Natural Language Understanding Framework Based on Deep Learning

机译:Korasa:基于深度学习的开源韩国自然语言理解框架的管道优化

获取原文
       

摘要

Since the emergence of deep learning-based chatbots for knowledge services, numerous research and development projects have been conducted in various industries. A high demand for chatbots has drastically increased the global market size; however, the limited functional scalability of open-domain chatbots is a challenge to their application to industries. Moreover, as most chatbot frameworks employ English, it is necessary to create chatbots customized for other languages. To address this problem, this paper proposes KoRASA as a pipeline-optimization method, which uses a deep learning-based open-source chatbot framework to understand the Korean language. KoRASA is a closed-domain chatbot that is applicable across a wide range of industries in Korea. KoRASA’s operation consists of four stages: tokenization, featurization, intent classification, and entity extraction. The accuracy and F 1-score of KoRASA were measured based on datasets taken from common tasks carried out in most industrial fields. The algorithm for intent classification and entity extraction was optimized. The accuracy and F 1-score were 98.2% and 98.4% for intent classification and 97.4% and 94.7% for entity extraction, respectively. Furthermore, these results are better than those achieved by existing models. Accordingly, KoRASA can be applied to various industries, including mobile services based on closed-domain chatbots using Korean, robotic process automation (RPA), edge computing, and Internet of Energy (IoE) services.
机译:自从基于深度学习的聊天聊天的信息以来,在各个行业都进行了众多研发项目。对Chatbots的高需求急剧增加了全球市场规模;然而,开放式域聊天的有限功能可扩展性是对其在行业的应用中的挑战。此外,由于大多数Chatbot框架使用英语,必须创建为其他语言自定义的Chatbots。为了解决这个问题,本文提出了Korasa作为一种管道优化方法,它使用基于深度学习的开源Chatbot框架来了解韩语。 Korasa是一个封闭式聊天聊天,适用于韩国各种行业。 Korasa的操作包括四个阶段:令牌化,特色,意图分类和实体提取。基于从大多数工业领域的共同任务所采取的数据集来测量Korasa的准确性和F 1分数。针对意图分类和实体提取的算法进行了优化。对于INTING分类分类分别为98.2%和98.4%,分别为97.4%和94.7%,分别为97.4%和94.7%。此外,这些结果优于现有模型实现的结果。因此,Korasa可以应用于各种行业,包括使用韩国,机器人过程自动化(RPA),边缘计算和能量互联网(IOE)服务的闭合域聊天的移动服务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号