首页> 外文会议>Workshop on computational linguistics for literature >A Dictionary of Wisdom and Wit: Learning to Extract Quotable Phrases
【24h】

A Dictionary of Wisdom and Wit: Learning to Extract Quotable Phrases

机译:智慧词典和机智:学会提取引用短语

获取原文
获取外文期刊封面目录资料

摘要

Readers suffering from information overload have often turned to collections of pithy and famous quotations. While research on large-scale analysis of text reuse has found effective methods for detecting widely disseminated and famous quotations, this paper explores the complementary problem of detecting, from internal evidence alone, which phrases are quotable. These quotable phrases are memorable and succinct statements that people are likely to find useful outside of their original context. We evaluate quotable phrase extraction using a large digital library and demonstrate that an integration of lexical and shallow syntactic features results in a reliable extraction process. A study using a reddit community of quote enthusiasts as well as a simple corpus analysis further demonstrate the practical applications of our work.
机译:患有信息过载的读者经常转向精辟和着名的引用。虽然关于文本重复使用的大规模分析的研究已经找到了检测广泛传播和着名引用的有效方法,但本文探讨了检测的互补问题,从内部证据单独检测,哪些短语是引用的。这些引用的短语是令人难忘的和简洁的陈述,人们可能在原始背景之外发现有用。我们使用大数字库评估引用短语提取,并证明了词汇和浅句法特征的集成导致可靠的提取过程。使用Reddit Community的引用爱好者和简单的语料库分析进行了一项研究进一步展示了我们工作的实际应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号