首页> 外文会议>International Conference on Future Computer and Communication >A Hybrid Approach to Semi-supervised Named Entity Recognition in Health, Safety and Environment Reports
【24h】

A Hybrid Approach to Semi-supervised Named Entity Recognition in Health, Safety and Environment Reports

机译:一种混合方法,可以在健康,安全和环境报告中半监督的名为实体认可

获取原文

摘要

In the last few years, text mining have become the area of interests in Natural Language Processing (NLP). They share a similar idea i.e. to extract important facts from unstructured text which later help to populate database entries. Name Entity Recognition (NER) is one of the main task needed to develop text mining systems in which it is used to identify and classify entities in the text into predefined categories such as the names of persons, organizations, locations, dates, times, quantities, monetary values, percentages, etc. This paper focuses on studying the optimum solution to perform NER. To achieve our target, Health Safety and Environment (HSE) reports available from the University Teknologi PETRONAS (UTP) are chosen as the case study. The UTPpsilas HSE reports are the investigation reports which contain the information on incidents and accidents occurred during the daily operations. Many algorithms have been reported for NER ranging from simple statistical methods to advanced Natural language Processing (NLP) methods. This paper describes the possibility to apply Link Grammar (LG) and Basilisk Algorithm in NER.
机译:在过去几年中,文本挖掘已成为自然语言处理的兴趣领域(NLP)。它们共享类似的想法,即将重要事实从非结构化文本中提取,稍后帮助填充数据库条目。名称实体识别(ner)是开发文本挖掘系统所需的主要任务之一,其中它用于将文本中的文本中的实体标识到预定义类别,例如人员,组织,位置,日期,次数,数量,数量的名称,货币价值观,百分比等。本文重点研究了研究执行ner的最佳解决方案。为了实现我们的目标,选择从Teknologi Petronas(UTP)大学提供的健康安全和环境(HSE)报告作为案例研究。 UTPPSILAS HSE报告是调查报告,其中包含日常行动中发生的事故和事故的信息。从简单的统计方法到高级自然语言处理(NLP)方法,已经报道了许多算法。本文介绍了在网中应用链路语法(LG)和Basileisk算法的可能性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号