Automatic text summarization of Wikipedia articles

机译：维基百科文章的自动文本摘要

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The main objective of a text summarization system is to identify the most important information from the given text and present it to the end users. In this paper, Wikipedia articles are given as input to system and extractive text summarization is presented by identifying text features and scoring the sentences accordingly. The text is first pre-processed to tokenize the sentences and perform stemming operations. We then score the sentences using the different text features. Two novel approaches implemented are using the citations present in the text and identifying synonyms. These features along with the traditional methods are used to score the sentences. The scores are used to classify the sentence to be in the summary text or not with the help of a neural network. The user can provide what percentage of the original text should be in the summary. It is found that scoring the sentences based on citations gives the best results.

机译：文本摘要系统的主要目标是从给定的文本中识别最重要的信息，并将其呈现给最终用户。在本文中，将Wikipedia文章作为系统输入，并通过识别文本特征并相应地对句子评分为摘要性文本摘要。首先对文本进行预处理，以标记句子并执行词干操作。然后，我们使用不同的文本功能对句子评分。实施的两种新颖方法是使用文本中的引文和识别同义词。这些功能与传统方法一起用于对句子评分。分数用于在神经网络的帮助下将句子分类为摘要文本还是不分类。用户可以提供摘要中原始文本的百分比。发现基于引用对句子评分可以得到最佳结果。

著录项

来源
《2015 International Conference on Communication, Information amp; Computing Technology》|2015年|1-4|共4页
会议地点 Mumbai(IN)
作者
Hingu Dharmendra; Shah Deep; Udmale Sandeep S.;
展开▼
作者单位

Dept. of Comput. Eng. Inf. Technol., Veermata Jijabai Technol. Inst., Mumbai, India;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Web sites; neural nets; text analysis; Wikipedia articles; automatic text summarization; neural network; sentence classification; sentence scoring; sentence tokenization; stemming operations; text feature identification; text preprocessing; Computers; Electronic publishing; Encyclopedias; Feature extraction; Internet; Neural networks; Frequency; Natural Language; Python; Text summarization;

机译：网站;神经网络;文本分析;维基百科文章;自动文本摘要;神经网络;句子分类;句子评分;句子标记化;词干操作;文本特征识别;文本预处理;计算机;电子出版;百科全书;特征提取;互联网;神经网络;频率;自然语言; Python;文本摘要;;

相似文献

外文文献
中文文献
专利

1. Automatic Linking of Short Arabic Texts to Wikipedia Articles [J] . Fatoom Fayad, Iyad AlAgha Journal of software . 2016,第12期

机译：自动将阿拉伯文短文本链接到维基百科文章
2. A Turkish Wikipedia Text Summarization System for Mobile Devices [J] . Akif Hatipoglu, Sevin? ?lhan Omurca International Journal of Information Technology and Computer Science . 2016,第1期

机译：移动设备的土耳其维基百科文本摘要系统
3. Text summarization using Wikipedia [J] . Yogesh Sankarasubramaniam, Krishnan Ramanathan, Subhankar Ghosh Information Processing & Management . 2014,第3期

机译：使用Wikipedia进行文本摘要
4. Automatic text summarization of Wikipedia articles [C] . Hingu Dharmendra, Shah Deep, Udmale Sandeep S. International Conference on Communication, Information Computing Technology . 2015

机译：维基百科文章的自动文本汇总
5. Automatic Text Summarizations [D] . Al-Sharman, Nesreen. 2018

机译：自动文本摘要
6. Towards Answering Biological Questions with Experimental Evidence: Automatically Identifying Text that Summarize Image Content in Full-Text Articles [O] . Hong Yu 2006

机译：尝试用实验证据回答生物学问题：自动识别全文文章中包含图像内容的文本
7. A new graph based text segmentation using Wikipedia for automatic text summarization [O] . Pourvali Mohsen 2012

机译：一种新的基于图的文本分割，使用维基百科进行自动文本汇总

Automatic text summarization of Wikipedia articles

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅