ISUTD: Intelligent System for Urdu Text De-Summarization

机译：ISUTD：核心核心文本缩减摘要的智能系统

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text De-Summarization is a method of increasing the document and explains the substantial point of the text. It is very rough assignment for humans to manually explain the central subject from the large article. De- Summarization can be separating into two branches as Abstractive and Extractive approaches. Extractive accumulates the imperative paragraph or sentence from the original document and presents them as an explanation. Urdu inherits a lot of vocabulary from Arabic, Persian and the native languages of South Asia. Due to this effect, Urdu has a complex morphology. In terms of syntax, it has a relatively free word order (Subject, Object, and Verb). Despite spoken by millions of people, Urdu is an under-resourced language in terms of available computational resources. We extent the single document extractive de-summarization methodology for Urdu based on the sentence weight algorithm especially for the news, sports, and health etc. topics. We encapsulate the manuscript by preprocessing (sentence segmentation, tokenization, stop words and lemmatization) and apply sentence weight algorithm.

机译：文本失败是一种增加文档的方法，并解释了文本的实质性点。人类是非常粗略的分配，用于手动解释大型文章的中心科目。扩展可以分为两个分支，作为抽象和提取方法。 Extractic累积了原始文件中的命令段落或句子，并将其作为解释。乌尔都语继承了来自阿拉伯语，波斯和南亚母语的大量词汇。由于这种效果，乌尔都语具有复杂的形态。在语法方面，它具有相对自由的单词顺序（主题，对象和动词）。尽管达到了数百万人，但乌尔都语是一种资源不足的语言，就可用的计算资源而言。基于句子权重算法，我们为乌尔都语的单一文档提取除序方法特别适用于新闻，体育和健康等主题。我们通过预处理（句子分割，标记化，停止单词和lemmatization）来封装稿件并应用句子权重算法。

著录项

来源
《International Conference on Engineering and Emerging Technologies》|2019年|311p|共5页
会议地点
作者
Muhammad Wasif Bhatti; Muhammad Aslam;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TB1-53;
关键词
Feature extraction; Automobiles; Tokenization; White spaces; Data mining; Computer science; Morphology;

机译：特征提取;汽车;牌匾;白色空间;数据挖掘;计算机科学;形态学;

相似文献

外文文献
中文文献
专利

1. Urdu Nasta'liq text recognition system based on multi-dimensional recurrent neural network and statistical features [J] . Naz Saeeda, Umar Arif I., Ahmad Riaz, Neural computing & applications . 2017,第2期

机译：基于多维经常性神经网络和统计特征的乌尔都语Nasta'liq文本识别系统
2. The Potential Impact of Intelligent Systems for Mobile Health Self-Management Support: Monte Carlo Simulations of Text Message Support for Medication Adherence [J] . Piette John D., Farris Karen B., Newman Sean, Annals of behavioral medicine : . 2015,第1期

机译：智能系统对移动健康自我管理支持的潜在影响：用于药物依从性的文本消息支持的蒙特卡洛模拟
3. Decision Making in Intelligent Training-Testing Systems Based on Mixed Diagnostic Texts [J] . A. E. Yankovskaya, M. E. Semenov Scientific & Technical Information Processing . 2013,第6期

机译：基于混合诊断文本的智能训练系统的决策
4. ISUTD: Intelligent System for Urdu Text De-Summarization [C] . Muhammad Wasif Bhatti, Muhammad Aslam International Conference on Engineering and Emerging Technologies . 2019

机译：ISUTD：乌尔都语文本摘要的智能系统
5. Environnement materiel et logiciel pour le developpement de systemes intelligents (French text). [D] . Ranger, Jean-Marc. 2000

机译：用于开发智能系统的硬件和软件环境（法语）。
6. The Potential Impact of Intelligent Systems for Mobile Health Self-Management Support: Monte Carlo Simulations of Text Message Support for Medication Adherence [O] . John D. Piette, Karen B. Farris, Sean Newman, -1

机译：智能系统对移动健康自我管理支持的潜在影响：用于药物依从性的文本消息支持的蒙特卡洛模拟
7. USAD: An Intelligent System for Slang and Abusive Text Detection in PERSO-Arabic-Scripted Urdu [O] . Nauman Ul Haq, Mohib Ullah, Rafiullah Khan, 2020

机译：USAD：Perso-Arabic脚本核武器中的俚语和滥用文本检测智能系统

ISUTD: Intelligent System for Urdu Text De-Summarization

摘要

著录项

相似文献

相关主题

期刊订阅