Cleaning safety records using text mining algorithms.

机译：使用文本挖掘算法清理安全记录。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This Project aims to de-identify the safety records using Natural Language Processing. By removing identifying information, near miss records can be shared across industry participants. The strategy for removing identifying information was to remove proper nouns. Proper nouns were identified by part of speech tagging using NLTK and a list of proper nouns developed by the project. This project achieved high accuracy (98%) and reasonable precision (45%) on records in upper and lower case. The performance on records written in all upper case was significantly worse. The majority of errors were due to capitalization, spelling, uncommon words, maritime specific words, and titles. This document presents twelve approaches to improve algorithm performance.

机译：该项目旨在使用自然语言处理来取消安全记录的标识。通过删除标识信息，可以在行业参与者之间共享未命中记录。删除识别信息的策略是删除专有名词。通过使用NLTK的语音标记和项目开发的一系列专有名词来识别专有名词。该项目在大写和小写记录方面都达到了较高的准确性（98％）和合理的准确性（45％）。大写形式的记录性能明显较差。大部分错误是由于大写，拼写，不常见的单词，海事专用单词和标题引起的。本文档介绍了十二种改善算法性能的方法。

著录项

作者
Chauhan, Vaibhav.;
展开▼
作者单位

Lamar University - Beaumont.;

展开▼
授予单位 Lamar University - Beaumont.;
学科 Engineering Industrial.
学位 M.E.S.
年度 2012
页码 54 p.
总页数 54
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Safety Related Drug-Labelling Changes : Findings from Two Data Mining Algorithms. [J] . Reich L Drug safety: An international journal of medical toxicology and drug experience . 2004,第10期

机译：与安全相关的药品标签更改：两种数据挖掘算法的发现。
2. Sequential Pattern Mining Algorithm Based on Text Data: Taking the Fault Text Records as an Example [J] . Xinglong Yuan, Wenbing Chang, Shenghan Zhou, Sustainability . 2018,第11期

机译：基于文本数据的顺序模式挖掘算法-以故障文本记录为例
3. Text Data Mining of In-patient Nursing Records Within Electronic Medical Records Using KeyGraph [J] . Muneo Kushima, Kenji Araki, Muneou Suzuki, IAENG Internaitonal journal of computer science . 2011,第3期

机译：使用KeyGraph在电子病历中的住院护理记录中进行文本数据挖掘
4. Research on text data mining of hospital patient records within Electronic Medical Records [C] . Kushima Muneo, Araki Kenji, Suzuki Muneou, International Conference on Soft Computing and Intelligent Systems;International Symposium on Advanced Intelligent Systems . 2014

机译：电子病历中医院病历文本数据挖掘的研究
5. Prediction of cost overruns using ensemble methods in data mining and text mining algorithms. [D] . Ramesh, Prathiksha. 2014

机译：在数据挖掘和文本挖掘算法中使用集成方法预测成本超支。
6. Text mining occupations from the mental health electronic health record: a natural language processing approach using records from the Clinical Record Interactive Search (CRIS) platform in south London UK [O] . Natasha Chilman, Xingyi Song, Angus Roberts, 2021

机译：从心理健康电子健康记录的文本挖掘职业：使用南伦敦南伦敦英国南部临床记录互动搜索（CRIS）平台的自然语言处理方法
7. Text Mining in Health Records: Classification of Text to Facilitate Information Flow and Data Overview [O] . Rose Øystein 2007

机译：健康记录中的文本挖掘：文本分类以促进信息流和数据概述
8. EPA Should Strengthen Records Management on Clean Water Act Section 404 Permit Notification Reviews for Surface Coal Mining [R] . Gilbride, P., Barne-Weaver, E., Strasser, M. A., 2012

机译：美国环保署应加强对清洁水法案的记录管理404地表采煤许可通知审查

Cleaning safety records using text mining algorithms.

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅