首页> 美国卫生研究院文献>Journal of Personalized Medicine >Automatic Labeled Dialogue Generation for Nursing Record Systems
【2h】

Automatic Labeled Dialogue Generation for Nursing Record Systems

机译:用于护理纪录系统的自动标记对话生成

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The integration of digital voice assistants in nursing residences is becoming increasingly important to facilitate nursing productivity with documentation. A key idea behind this system is training natural language understanding (NLU) modules that enable the machine to classify the purpose of the user utterance (intent) and extract pieces of valuable information present in the utterance (entity). One of the main obstacles when creating robust NLU is the lack of sufficient labeled data, which generally relies on human labeling. This process is cost-intensive and time-consuming, particularly in the high-level nursing care domain, which requires abstract knowledge. In this paper, we propose an automatic dialogue labeling framework of NLU tasks, specifically for nursing record systems. First, we apply data augmentation techniques to create a collection of variant sample utterances. The individual evaluation result strongly shows a stratification rate, with regard to both fluency and accuracy in utterances. We also investigate the possibility of applying deep generative models for our augmented dataset. The preliminary character-based model based on long short-term memory (LSTM) obtains an accuracy of 90% and generates various reasonable texts with BLEU scores of 0.76. Secondly, we introduce an idea for intent and entity labeling by using feature embeddings and semantic similarity-based clustering. We also empirically evaluate different embedding methods for learning good representations that are most suitable to use with our data and clustering tasks. Experimental results show that fastText embeddings produce strong performances both for intent labeling and on entity labeling, which achieves an accuracy level of 0.79 and 0.78 f1-scores and 0.67 and 0.61 silhouette scores, respectively.
机译:数字语音助理在护理居民的整合变得越来越重要,便于促进与文档的护理生产力。该系统背后的一个关键思想是培训自然语言理解(NLU)模块,使机器能够对用户话语(意图)的目的进行分类,并提取话语中存在的有价值的信息(实体)。创建稳健的主要障碍之一是缺乏足够的标记数据,这通常依赖于人类标记。这个过程是成本密集和耗时的,特别是在高级护理领域,这需要抽象的知识。在本文中,我们提出了一种自动对话标签框架,专门用于护理记录系统。首先,我们应用数据增强技术以创建变体样本话语的集合。个体评估结果强烈地显示了分层率,关于流畅性和话语的准确性。我们还调查为我们增强数据集应用深度生成模型的可能性。基于长短期存储器(LSTM)的基于初步性状的模型获得了90%的精度,并产生各种合理的文本,具有0.76的BLEU分数。其次,我们通过使用特征嵌入和基于语义相似性的群集来介绍意图和实体标记的想法。我们还经验评估了学习最适合与我们的数据和群集任务一起使用的良好表现的不同嵌入方法。实验结果表明,FastText Embeddings分别产生了强烈的表现,可分别为实体标记和实体标记进行强大的性能。分别实现了0.79和0.78 F1分别的精度水平和0.67和0.61轮廓分数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号