首页> 美国卫生研究院文献>BMC Medical Informatics and Decision Making >Deep learning with sentence embeddings pre-trained on biomedical corpora improves the performance of finding similar sentences in electronic medical records
【2h】

Deep learning with sentence embeddings pre-trained on biomedical corpora improves the performance of finding similar sentences in electronic medical records

机译:在生物医学语料库上预先训练的带有句子嵌入的深度学习可提高在电子病历中查找相似句子的性能

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

An overview of our models. The Random Forest uses manually crafted features (word tokens, character n-grams, sequence similarity, semantic similarity and named entities). The feature selection of the Random Forest was done on the validation set. The Neural Network uses vectors generated by sentence embeddings as inputs. The validation set was used to monitor the early stopping process of the neural network. The ensembled (stacking) model incorporates both the Random Forest and Neural Network models. The validation set was used to train the ensembled model
机译:我们的模型概述。随机森林使用手工制作的功能(单词标记,字符n-gram,序列相似度,语义相似度和命名实体)。随机森林的特征选择是在验证集上完成的。神经网络使用由句子嵌入生成的向量作为输入。验证集用于监视神经网络的早期停止过程。集成(堆叠)模型同时包含随机森林模型和神经网络模型。验证集用于训练集成模型

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号