首页> 外文会议>IEEE International Conference on Bioinformatics and Biomedicine >Incorporating Syntactic Dependencies into Semantic Word Vector Model for Medical Text Processing
【24h】

Incorporating Syntactic Dependencies into Semantic Word Vector Model for Medical Text Processing

机译:将句法依存关系纳入用于医学文本处理的语义词向量模型

获取原文

摘要

Natural language processing has been split into two sub-fields: semantic and syntactic analysis. Semantic processing deals with creating vector representations for each word. The goal is for the vectors to be dense and that relationships between words are made clear by their vectors. Word2Vec Skip-Gram model is one such semantic word embedding model that processes the words into vectors based on the co-occurrences of the words within a fixed context window. Syntactic processing, on the other hand, deals with gleaning textual information from the structure of the text. A popular method of syntactic processing is dependency parsing, where connections are explicitly drawn between different pairs of words in a phrase or clause, where one word modifies the other. Human beings make use of both semantic and syntactic information when processing text. As such, combining semantic and syntactic processing is important for medical text analysis. The dependencies between words define specific medical concepts, such as diseases and symptoms. The aim of this research paper is to investigate an altered word vector model-the SD Skip-Gram model which incorporates the dependencies between words. The SD Skip-Gram model is compared against the basic Skip-Gram model for analyzing common words and medical disease and symptom concepts. The returned results demonstrate that the SD Skip-Gram model can identify the relationships between the common words as well as the basic Skip-Gram model and perform better than the basic Skip-Gram model in identifying the related disease and symptom concepts.
机译:自然语言处理已分为两个子领域:语义分析和句法分析。语义处理涉及为每个单词创建矢量表示。目的是使向量密集并且通过它们的向量使单词之间的关系清楚。 Word2Vec Skip-Gram模型就是这样一种语义词嵌入模型,它基于固定上下文窗口中词的共现,将词处理为向量。另一方面,句法处理处理从文本结构中收集文本信息的问题。语法处理的一种流行方法是依赖项解析,其中在短语或从句中的不同单词对之间显式绘制连接,其中一个单词修饰另一个单词。人类在处理文本时会同时使用语义和句法信息。因此,将语义和句法处理相结合对于医学文本分析很重要。单词之间的依赖关系定义了特定的医学概念,例如疾病和症状。本研究的目的是研究一种改变了的词向量模型-SD Skip-Gram模型,该模型结合了词之间的依赖性。将SD Skip-Gram模型与基本的Skip-Gram模型进行比较,以分析常用词以及医学疾病和症状概念。返回结果表明,SD Skip-Gram模型可以识别常见单词与基本Skip-Gram模型之间的关系,并且在识别相关疾病和症状概念方面比基本Skip-Gram模型更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号