...
首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Automatic Twitter Topic Summarization With Speech Acts
【24h】

Automatic Twitter Topic Summarization With Speech Acts

机译:带有语音行为的自动Twitter主题摘要

获取原文
获取原文并翻译 | 示例

摘要

With the growth of the social media service of Twitter, automatic summarization of Twitter messages (tweets) is in urgent need for efficient processing of the massive tweeted information. Unlike multi-document summarization in general, Twitter topic summarization must handle the numerous, short, dissimilar, and noisy nature of tweets. To address this challenge, we propose a novel speech act-guided summarization approach in this work. Speech acts characterize tweeters' communicative behavior and provide an organized view of their messages. Speech act recognition is a multi-class classification problem, which we solve by using word-based and symbol-based features that capture both the linguistic features of speech acts and the particularities of Twitter text. The recognized speech acts in tweets are then used to direct the extraction of key words and phrases to fill in templates designed for speech acts. Leveraging high-ranking words and phrases as well as topic information for major speech acts, we propose a round-robin algorithm to generate template-based summaries. Different from the extractive method adopted in most previous works, our summarization method is abstractive. Evaluated on two 100-topic datasets, the summaries generated by our method outperform two kinds of representative extractive summaries and rival human-written summaries in terms of explanatoriness and informativeness.
机译:随着Twitter社交媒体服务的增长,迫切需要对Twitter消息(推文)进行自动汇总,以有效地处理大量推文信息。与一般的多文档摘要不同,Twitter主题摘要必须处理推文的众多,简短,不同且嘈杂的性质。为了解决这一挑战,我们在这项工作中提出了一种新颖的言语行为引导总结方法。言语行为表征了高音扬声器的交流行为,并提供了其信息的组织化视图。言语行为识别是一个多类分类问题,我们通过使用基于单词和基于符号的特征来解决该问题,这些特征既捕获了言语行为的语言特征,又捕捉了Twitter文本的特殊性。然后,在推文中识别出的语音行为将用于指导关键词和短语的提取,以填充为语音行为设计的模板。利用主要语音行为的高级单词和短语以及主题信息,我们提出了一种循环算法来生成基于模板的摘要。与大多数以前的工作中采用的提取方法不同,我们的摘要方法是抽象的。在两个100个主题的数据集上进行评估后,我们的方法得出的摘要在解释性和信息性方面均优于两种代表性的摘要和与人工撰写的摘要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号