【24h】

Generating Continuous Representations of Medical Texts

机译:生成医学文本的连续表示

获取原文

摘要

We present an architecture that generates medical texts while learning an informative, continuous representation with discriminative features. During training the input to the system is a dataset of captions for medical X-Rays. The acquired continuous representations are of particular interest for use in many machine learning techniques where the discrete and high-dimensional nature of textual input is an obstacle. We use an Adversarially Regularized Autoencoder to create realistic text in both an unconditional and conditional setting. We show that this technique is applicable to medical texts which often contain syntactic and domain-specific shorthands. A quantitative evaluation shows that we achieve a lower model perplexity than a traditional LSTM generator.
机译:我们提供一种生成医学文本的体系结构,同时学习具有判别功能的信息丰富,连续的表示形式。在训练期间,系统的输入是医学X射线字幕的数据集。对于许多机器学习技术(其中文本输入的离散性和高维性是一个障碍)而言,所获取的连续表示形式特别有意义。我们使用对抗性正则化自动编码器在无条件和有条件的设置中创建逼真的文本。我们表明,该技术适用于通常包含语法和特定领域速记的医学文本。定量评估表明,与传统的LSTM生成器相比,我们实现了更低的模型复杂性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号