...
首页> 外文期刊>The Journal of Artificial Intelligence Research >Cause Identification from Aviation Safety Incident Reports via Weakly Supervised Semantic Lexicon Construction
【24h】

Cause Identification from Aviation Safety Incident Reports via Weakly Supervised Semantic Lexicon Construction

机译:通过弱监督的语义词典构造从航空安全事件报告中识别原因

获取原文
           

摘要

The Aviation Safety Reporting System collects voluntarily submitted reports on aviation safety incidents to facilitate research work aiming to reduce such incidents. To effectively reduce these incidents, it is vital to accurately identify why these incidents occurred. More precisely, given a set of possible causes, or shaping factors, this task of cause identification involves identifying all and only those shaping factors that are responsible for the incidents described in a report. We investigate two approaches to cause identification. Both approaches exploit information provided by a semantic lexicon, which is automatically constructed via Thelen and Riloff's Basilisk framework augmented with our linguistic and algorithmic modifications. The first approach labels a report using a simple heuristic, which looks for the words and phrases acquired during the semantic lexicon learning process in the report. The second approach recasts cause identification as a text classification problem, employing supervised and transductive text classification algorithms to learn models from incident reports labeled with shaping factors and using the models to label unseen reports. Our experiments show that both the heuristic-based approach and the learning-based approach (when given sufficient training data) outperform the baseline system significantly.
机译:航空安全报告系统收集自愿提交的关于航空安全事件的报告,以促进旨在减少此类事件的研究工作。为了有效地减少这些事件,至关重要的是要准确识别这些事件发生的原因。更准确地说,给定一组可能的原因或整形因素,原因识别任务涉及识别所有且仅负责报告中描述的事件的那些整形因素。我们研究了两种原因识别方法。两种方法都利用了语义词典提供的信息,该语义词典是通过Thelen和Riloff的Basilisk框架自动构建的,并对其进行了语言和算法修改。第一种方法使用简单的试探法标记报告,该试探法在报告中寻找在语义词典学习过程中获得的单词和短语。第二种方法重铸了将原因识别为文本分类问题的方法,它采用监督和转导性文本分类算法从标记有整形因子的事件报告中学习模型,并使用这些模型来标记看不见的报告。我们的实验表明,无论是基于启发式的方法还是基于学习的方法(在获得足够的训练数据的情况下)均明显优于基准系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号