【24h】

Automated Verbal-Pattern Extraction from Political News Articles using CAMEO Event Coding Ontology

机译:使用CAMEO事件编码本体从政治新闻文章中自动进行言语模式提取

获取原文

摘要

Structured metadata extraction from raw-text in different domains is gaining strong attention from the research communities and offering good applicable scenarios. In Political Science, these metadata play a significant role in studying and predicting intra-and inter-state relationships and often follows a certain structural representation. Our work exploits CAMEO (Conflict and Mediation Event Observations) ontology to extract events from political news articles. This event will be represented by metadata such as who-did-what-to-whom format. We face a number of challenges in this extraction process due to the incompleteness of CAMEO dictionaries. In particular, for some verb patterns appeared in articles we may not find a appropriate match in CAMEO verb dictionary (i.e 'what'). Our goal is to recommend new verb patterns by mining a large number of news articles to enhance the existing CAMEO verb dictionaries. For this, first we apply Association Rule Mining to find frequent verb patterns. Next, to assign categories of these verb patterns, we develop a novel algorithm based on Word-to-Vec model. Finally, we develop a prototype system for automatically recommending a number of useful verb patterns.
机译:从不同领域的原始文本中进行结构化元数据提取正受到研究界的广泛关注,并提供了良好的适用场景。在政治学中,这些元数据在研究和预测州内和州际关系中起着重要作用,并且通常遵循一定的结构表示形式。我们的工作利用CAMEO(冲突和调解事件观察)本体来从政治新闻文章中提取事件。该事件将由诸如“谁要做什么”的元数据表示。由于CAMEO词典的不完整,我们在提取过程中面临许多挑战。特别是,对于文章中出现的某些动词模式,我们可能在CAMEO动词词典中找不到适当的匹配项(即“ what”)。我们的目标是通过挖掘大量新闻文章来推荐新的动词模式,以增强现有的CAMEO动词词典。为此,首先我们应用关联规则挖掘来查找频繁的动词模式。接下来,为分配这些动词模式的类别,我们开发了一种基于Word-to-Vec模型的新颖算法。最后,我们开发了一个原型系统,用于自动推荐许多有用的动词模式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号