首页> 外文会议>International Conference on Social Networks Analysis, Management and Security >A Supervised Machine Learning Approach for Events Extraction out of Arabic Tweets
【24h】

A Supervised Machine Learning Approach for Events Extraction out of Arabic Tweets

机译:阿拉伯语推文的事件提取的监督机器学习方法

获取原文

摘要

Tweets provide a continuous update on daily events, however they are noisy text, personalized and challenging to be understood by machines. This shows a need for event extraction and representation approaches. This research describes a state-of-the-art supervised machine learning approach for extracting events out of Arabic tweets. The proposed approach focuses on three research tasks: Task 1: Event Trigger Extraction, Task 2: Event Time Expression Extraction, Task 3: Event Type Identification. The proposed approach was evaluated on a dataset of 2k Arabic tweets and the evaluation results were promising. The approach performance was compared to an unsupervised rule-based approach from previous work using the same dataset. Results show that the proposed approach outperforms the unsupervised rule-based approach in tasks T1: event trigger extraction (F-1=92.6 vs. F-1=78.7) and T2: event time expression extraction (F-1=92.8 vs. F-1=88.35), whereas is acting relatively worse in T3: event type identification (Accuracy=80.1 vs. Accuracy=95.9).
机译:Tweets在日常活动中提供了连续的更新,但是它们是嘈杂的文本,是通过机器理解的个性化和具有挑战性的。这表明需要事件提取和表示方法。该研究描述了一种用于从阿拉伯语推文中提取事件的最先进的监督机器学习方法。所提出的方法侧重于三个研究任务:任务1:事件触发提取,任务2:事件时间表达提取,任务3:事件类型识别。拟议的方法在2K阿拉伯语推文的数据集上进行了评估,评估结果很有前景。将方法性能与使用相同数据集的先前工作的无监督规则的方法进行比较。结果表明,所提出的方法优于任务T1:事件触发提取(F-1 = 92.6对F-1 = 78.7)和T2:事件时间表达提取(F-1 = 92.8与F.)突出的基于规则的方法-1 = 88.35),而在T3中相对较差:事件类型识别(精度= 80.1与精度= 95.9)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号