首页> 外文会议>International Conference on Information Technology Research >Text Summarization for Tamil Online Sports News Using NLP
【24h】

Text Summarization for Tamil Online Sports News Using NLP

机译:使用NLP的泰米尔在线体育新闻的文本摘要

获取原文

摘要

Text summarization plays an important problem in natural language understanding and information retrieval. Automatic text summarization get much more attention by people presently because it is efficiently and effectively serve time in decision making process even for day to day life. Presently deep learning models get more attention than the traditional approaches. The primary objective of this research work is to propose a methodology to address the problem of summarization for Tamil sports news which can automatically create extractive summary for the news data with the use of Natural Language Processing (NLP) and a generic stochastic artificial neural network. Features such as sentence position, sentence position related to paragraph, number of named entities, term frequency and inverse document frequency and Number of numerals are employed to construct the feature matrix for each sentence and Restricted Boltzmann Machine is used to improve those features while enhancing the accuracy without loosing the main idea of the text. Experimentation is carried out using Online Tamil sports news and ROUGE tool kit is used to evaluate the recall, precision and F-measure for the summary generated by both the human experts and the system.
机译:文本摘要在自然语言理解和信息检索中起着重要问题。现在,自动文本摘要目前通过人们更加关注,因为它在日常生活中甚至可以在决策过程中有效和有效地服务。目前深入学习模型比传统方法更加关注。本研究工作的主要目标是提出一种方法来解决泰米尔体育新闻的摘要问题,这些问题可以通过使用自然语言处理(NLP)和通用随机人工神经网络来自动创建新闻数据的提取摘要。句子位置等特征,与段落相关的句子位置,命名实体的数量,术语频率和逆文档频率和数字的数量来构造每个句子的特征矩阵,而受限的boltzmann机器用于改善这些功能,同时增强这些功能准确性而不减少文本的主要思想。实验使用在线泰米尔泰米尔体育新闻,Rouge工具套件用于评估人类专家和系统产生的摘要的召回,精确和F测量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号