...
【24h】

Argument Extraction from News, Blogs, and the Social Web

机译:从新闻,博客和社交网络中提取参数

获取原文
获取原文并翻译 | 示例

摘要

Argument extraction is the task of identifying arguments, along with their components in text. Arguments can be usually decomposed into a claim and one or more premises justifying it. Among the novel aspects of this work is the thematic domain itself which relates to Social Media, in contrast to traditional research in the area, which concentrates mainly on law documents and scientific publications. The huge increase of social media communities, along with their user tendency to debate, makes the identification of arguments in these texts a necessity. Argument extraction from Social Media is more challenging because texts may not always contain arguments, as is the case of legal documents or scientific publications usually studied. In addition, being less formal in nature, texts in Social Media may not even have proper syntax or spelling. This paper presents a two-step approach for argument extraction from social media texts. During the first step, the proposed approach tries to classify the sentences into "sentences that contain arguments" and "sentences that don't contain arguments". In the second step, it tries to identify the exact fragments that contain the premises from the sentences that contain arguments, by utilizing conditional random fields. The results exceed significantly the base line approach, and according to literature, are quite promising.
机译:参数提取是识别参数及其在文本中的组成部分的任务。通常可以将论点分解为一个索赔,并为一个或多个前提辩护。与该领域的传统研究相反,该领域的创新领域包括与社交媒体相关的主题领域本身,该领域主要集中于法律文件和科学出版物。社交媒体社区的大量增加,以及其用户辩论的趋势,使得在这些文本中辨别论点成为必要。从社交媒体中提取参数更具挑战性,因为文本可能并不总是包含参数,就像通常研究的法律文件或科学出版物一样。此外,社交媒体中的文本性质上不太正式,甚至可能没有正确的语法或拼写。本文提出了一种从社交媒体文本中提取参数的两步方法。在第一步中,建议的方法尝试将句子分为“包含自变量的句子”和“不包含自变量的句子”。在第二步中,它尝试利用条件随机字段从包含自变量的句子中识别出包含前提的确切片段。结果大大超过了基线方法,并且根据文献,这是很有希望的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号