首页> 美国卫生研究院文献>other >Ontology-Guided Feature Engineering for Clinical Text Classification

【2h】

Ontology-Guided Feature Engineering for Clinical Text Classification

机译：临床文本分类的本体论引导特征工程

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this study we present novel feature engineering techniques that leverage the biomedical domain knowledge encoded in the Unified Medical Language System (UMLS) to improve machine-learning based clinical text classification. Critical steps in clinical text classification include identification of features and passages relevant to the classification task, and representation of clinical text to enable discrimination between documents of different classes. We developed novel information-theoretic techniques that utilize the taxonomical structure of the Unified Medical Language System (UMLS) to improve feature ranking, and we developed a semantic similarity measure that projects clinical text into a feature space that improves classification. We evaluated these methods on the 2008 Integrating Informatics with Biology and the Bedside (I2B2) obesity challenge. The methods we developed improve upon the results of this challenge’s top machine-learning based system, and may improve the performance of other machine-learning based clinical text classification systems. We have released all tools developed as part of this study as open source, available at

机译：在这项研究中，我们提出了新颖的特征工程技术，利用统一医疗语言系统（UMLS）编码的生物医学域知识来改善基于机器学习的临床文本分类。临床文本分类中的关键步骤包括识别与分类任务相关的特征和段落，以及临床文本的表示，以实现不同类别的文件之间的歧视。我们开发了利用统一医疗语言系统（UMLS）的分类学结构来改善特征排名的新型信息 - 理论技术，并开发了一个语义相似度措施，将临床文本投入到改善分类的特征空间中。我们在2008年将这些方法与生物学和床头（I2B2）肥胖挑战进行了评估了2008年集成信息学。我们制定了对该挑战的最高机器学习系统的结果的提高，并可以提高基于机器学习的临床文本分类系统的性能。我们已发布作为本研究的一部分开发的所有工具，如开源，可用

著录项

期刊名称 other
作者
Vijay N. Garla; Cynthia Brandt;
展开▼
作者单位

展开▼
年(卷),期 -1(45),5
年度 -1
页码 992–998
总页数 18
原文格式 PDF
正文语种
中图分类
关键词
Natural Language Processing Document Classification Semantic Similarity Feature Selection Kernel Methods Information Gain Information Content;

机译：自然语言处理;文档分类;语义相似度;特征选择;内核方法;信息增益;信息内容;

相似文献

外文文献
中文文献
专利

1. Ontology-guided feature engineering for clinical text classification [J] . GarlaV.N., BrandtC. Journal of biomedical informatics. . 2012,第5期

机译：本体指导的特征工程用于临床文本分类
2. Identifying Clinical Terms in Medical Text Using Ontology-Guided Machine Learning [J] . Aryan Arbabi, David R Adams, Sanja Fidler, JMIR Medical Informatics . 2019,第2期

机译：使用本体导向机学习识别医疗文本中的临床术语
3. An ensemble scheme based on language function analysis and feature engineering for text genre classification [J] . Aytuğ Onan Journal of Information Science . 2018,第1期

机译：基于语言功能分析和特征工程的文本体裁分类集成方案
4. Clinical Text Classification with Word Embedding Features vs. Bag-of-Words Features [C] . Yijun Shao, Stephanie Taylor, Nell Marshall, IEEE International Conference on Big Data . 2018

机译：具有词嵌入功能的临床文本分类与词袋功能
5. Text Detection in Natural Scenes and Technical Diagrams with Convolutional Feature Learning and Cascaded Classification. [D] . Zhu, Siyu. 2016

机译：具有卷积特征学习和级联分类的自然场景和技术图中的文本检测。
6. Clinical text classification with rule-based features and knowledge-guided convolutional neural networks [O] . Liang Yao, Chengsheng Mao, Yuan Luo 2019

机译：具有基于规则的功能和知识导向的卷积神经网络的临床文本分类
7. Multi-Class Imbalance in Text Classification: A Feature Engineering Approach to Detect Cyberbullying in Twitter [O] . Bandeh Ali Talpur, Declan O’Sullivan 2020

机译：文本分类中的多级不平衡：一种检测Twitter中的网络欺凌的特征工程方法

Ontology-Guided Feature Engineering for Clinical Text Classification

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅