首页> 外文会议>IEEE International Conference on Semantic Computing >On-Device Extractive Text Summarization
【24h】

On-Device Extractive Text Summarization

机译:设备上的提取文本摘要

获取原文

摘要

With increasing connectivity, there has been an exponential surge in the creation and availability of textual content in the form of news articles, blogs, social media posts and product reviews. A large portion of this data is consumed on mobile devices, and more recently, through wearables and smart speakers. Text summarization involves generating a brief description of a text, which captures the overall intention and the vital information being conveyed in its content. Common techniques for automatic text summarization follow extractive or abstractive approaches and involve large scale models with millions of parameters. While such models can be utilized in web or cloud-based applications, they are impractical for deployment on devices with limited storage and computational capabilities. In this paper, we propose a novel character-level neural architecture for extractive text summarization, with the model size reduced by 99.64% to 97.98% from existing methods, thus making it suitable for deployment on-device such as in mobiles, tabs and smart speakers. We tested the performance of our model on various benchmark datasets and compared it with several strong baselines and models. Despite using only a fraction of the space, our model outperformed the baselines and several state-of-the-art models, while coming close in performance with others. On-device text summarization remains largely an unexplored area, and our model's results show a promising approach towards building summarization models suitable for a constrained environment.
机译:随着连通性的增加,新闻文章,博客,社交媒体帖子和产品评论的形式创建和可用性的指数飙升。通过可穿戴设备和智能扬声器,在移动设备上消耗大部分这些数据,最近更新。文本摘要涉及生成文本的简要说明,该文本捕获了整体意图和在其内容中传达的重要信息。自动文本摘要的常用技术遵循提取或抽象方法,并涉及具有数百万参数的大规模模型。虽然这些模型可以用于基于Web或基于云的应用程序,但它们对于在具有有限的存储和计算能力的设备上部署是不切实际的。在本文中,我们提出了一种新的性格级神经结构,用于提取文本摘要,模型大小从现有方法减少了99.64%至97.98%,从而使其适用于设备,如手机,标签和智能等设备发言者。我们在各种基准数据集中测试了模型的性能,并将其与几个强大的基线和模型进行了比较。尽管只使用了空间的一小部分,我们的模型表现出基线和几种最先进的模型,同时与他人的性能接近。设备文本摘要仍然很大程度上是一个未开发的地区,我们的模型的结果表明了建立适合于受限制环境的摘要模型的有希望的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号