首页> 外文OA文献 >Frequent Term based Text Summarization for Bahasa Indonesia
【2h】

Frequent Term based Text Summarization for Bahasa Indonesia

机译:印度尼西亚语基于术语的文本摘要

摘要

Text summary helps in understanding the content of a text without having to read the contents of the text as a whole. Automatic text summarization can be used to summarize the text easier. In this paper a frequent term based text summarization for Bahasa Indonesia is designed and implemented in java. The proposed system generates a summary for a given input document based on identification and extraction of important sentences in the document. The system counts nouns and verbs term frequency because they are considered as the most representative to the content of the text. The system also integrated to statistical approach with two underlying concepts such as title of the news article and location of the sentence. The generated summaries were compared with human generated summaries. Precision, recall and f-measure ratio are used to evaluate the accuracy of the generated summary. Assessment of the system summary result quality by respondents is also done by giving a value from 1 to 100. Based on the experimental results, the system is able to produce an effective summary with the average f-measure of 78%, at the compression rate of 30%. The average value of the quality of system summary result provided by respondents is 83,3
机译:文本摘要有助于理解文本的内容,而不必阅读整个文本的内容。自动文本摘要可用于更轻松地总结文本。在本文中,印度尼西亚语的频繁术语基于文本的摘要是用Java设计和实现的。所提出的系统基于对文档中重要句子的识别和提取来生成给定输入文档的摘要。该系统计算名词和动词的词频,因为它们被认为是文本内容的最有代表性。该系统还集成了具有两个基本概念的统计方法,例如新闻文章的标题和句子的位置。将生成的摘要与人工生成的摘要进行比较。精度,召回率和f度量比率用于评估所生成摘要的准确性。还可以通过给出1到100的值来对受访者的系统摘要结果质量进行评估。根据实验结果,系统可以在压缩率下产生平均f值为78%的有效摘要。 30%。受访者提供的系统总结结果的质量平均值为83,3

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号