首页> 外文期刊>Knowledge-Based Systems >Multi-documents Automatic Abstracting based on text clustering and semantic analysis
【24h】

Multi-documents Automatic Abstracting based on text clustering and semantic analysis

机译:基于文本聚类和语义分析的多文档自动摘要

获取原文
获取原文并翻译 | 示例
       

摘要

A method of realization of multi-documents Automatic Abstracting based on text clustering and semantic analysis is brought forward, aimed at overcoming shortages of some current methods about multi-documents. The method makes use of semantic analysis and can realize Automatic Abstracting of multi-documents. The algorithm of twice word segmentation based on the title and first-sentences in paragraphs is brought forward. Its precision and recall is above 95%. For a specific domain on plastics, an Automatic Abstracting system named TCAAS is implemented. The precision and recall of multi-document's Automatic Abstracting is above 75%. And experiments do prove that it is feasible to use the method to develop a domain Automatic Abstracting system, which is valuable for further study in more depth.
机译:提出了一种基于文本聚类和语义分析的多文档自动摘录的实现方法,旨在克服目前有关多文档的一些方法的不足。该方法利用语义分析,可以实现多文档的自动抽象。提出了基于段落标题和第一句的两次分词算法。其精度和召回率均在95%以上。对于塑料的特定领域,实施了名为TCAAS的自动提取系统。多文档自动摘要的准确性和召回率超过75%。实验证明,采用该方法开发领域自动抽象系统是可行的,对进一步深入研究具有参考价值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号