首页> 外国专利> METHOD OF SUMMARIZING TEXT WITH SENTENCE EXTRACTION

METHOD OF SUMMARIZING TEXT WITH SENTENCE EXTRACTION

机译:用句子提取汇总文本的方法

摘要

A method for summarizing text with sentence extraction including steps as follows. Sentences are extracted from a document including text by a natural language processing (NLP) based feature extractor. A word vector set with respect to each of the sentences is generated by a processor. The word vector set with respect to each of the sentences is used to generate a n-grams vector set and a phrase-n vector set with respect to each of the sentences. A word score representing similarity between the word vector sets, a n-grams score representing similarity between the n-grams vector sets, and a phrase-n score representing similarity between the phrase-n vector sets are computed. The word, n-grams, and phrase-n scores are combined to compute an edge score. Text features are selected from the sentences using the edge scores of the sentences, so as to output a summary of the document.
机译:一种汇总文本的方法,句子提取包括如下的步骤。从基于自然语言处理(NLP)的特征提取器的文档中提取了句子。关于每个句子的一个字矢量集由处理器生成。关于每个句子的字向量集用于生成关于每个句子的n-grams向量集和短语-n向量集合。表示单词评分表示单词矢量集之间的相似性,表示表示n-gram向量集之间的相似性的n克分数,以及表示短语-n向量集之间的相似性的短语-n得分。组合单词,n-gram和短语分数以计算边缘分数。文本功能选自句子的句子中的句子,以便输出文档的摘要。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号