首页> 外国专利> SYSTEMS AND METHODS FOR AUTOMATICALLY GENERATING CONTENT SUMMARIES FOR TOPICS

SYSTEMS AND METHODS FOR AUTOMATICALLY GENERATING CONTENT SUMMARIES FOR TOPICS

机译:用于自动生成主题内容摘要的系统和方法

摘要

A method of automatically generating content summaries for topics includes receiving a taxonomy for a concept and a text corpus. The method further includes generating an annotated dataset having term annotations corresponding to the concept from the text corpus based on the taxonomy, parsing the annotated dataset into a custom generated document object having a structured layout, determining features for the term annotations, and extracting snippets from the custom generated document object, where each of the snippets corresponds to a section of the custom generated document object. The method further includes scoring the snippets based on the features such that each of the snippets corresponds to a score, filtering one or more snippets from the snippets when one or more snippet filtering conditions is met, ranking the snippets into an ordered list for the concept based on the score, and providing, to a user computing device, the ordered list.
机译:一种自动为主题生成内容摘要的方法,包括接收概念和文本语料库的分类法。该方法还包括基于分类法从文本语料库生成具有与该概念相对应的术语注释的带注释的数据集,将注释的数据集解析为具有结构化布局的定制生成的文档对象,确定术语注释的特征以及从中提取摘录定制生成的文档对象,其中每个片段都对应于定制生成的文档对象的一部分。该方法还包括:基于特征对片段评分,以使得每个片段对应于得分;当满足一个或多个片段过滤条件时,从片段中过滤一个或多个片段;将片段排序为该概念的有序列表基于得分,并将排序后的列表提供给用户计算设备。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号