Generation of Original Text with Text Mining and Deep Learning Methods for Turkish and Other Languages

机译：通过土耳其和其他语言的文本挖掘和深度学习方法生成原始文本

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The amount of content on the web has increased dramatically since the Internet began providing users with the ability to produce content. Initial work on original text production has aimed at publishing the given data by putting in a certain mold. The most obvious example of this is the analysis reports on sporting events. However, preparing an original text compiled with general information about a subject has become a subject of interest to scientists as well. Although Neural Networks and Markov models were used previously for original text production, the original text generation process and comparison of the success rates weren't done using the Turkish language and the academic publication data repository dataset. In this study, it was tried to create summary information/original content about a specific subject by using Wikipedia TR for the Turkish language and the data pool created with hundreds of thousands of academic publications. In the study, texts were produced with Markov Model and LSTM, which were previously proposed, and the results are comparatively shared in detail. In the evaluation study, the performance of the proposed method was examined, and the correctness of the techniques was evaluated concerning syntactic accuracy and semantic preservation. The results are evaluated by presenting a mixture of original and machine-generated texts to the actual user for the success test of the proposed method. The success rate of the results is calculated with accuracy, recall, and f-measure. The results are very promising because it has been observed that the method can produce accurate and quality representations.

机译：自从Internet开始为用户提供产生内容的能力以来，Web上的内容量已急剧增加。原始文本制作的初步工作旨在通过放入特定模型来发布给定数据。最明显的例子是关于体育赛事的分析报告。然而，准备用有关该主题的一般信息汇编的原始文本也已成为科学家感兴趣的主题。尽管以前曾使用神经网络和马尔可夫模型来制作原始文本，但是并没有使用土耳其语语言和学术出版物数据存储集数据集来完成原始文本生成过程和成功率的比较。在这项研究中，尝试通过使用土耳其语的Wikipedia TR和由成千上万的学术出版物创建的数据库来创建有关特定主题的摘要信息/原始内容。在这项研究中，使用先前提出的马尔可夫模型和LSTM编写了文本，并比较详细地共享了结果。在评估研究中，检查了所提方法的性能，并评估了该技术在句法准确性和语义保留方面的正确性。通过将原始文本和机器生成的文本的混合物呈现给实际用户来评估所提出方法的成功性，从而对结果进行评估。结果的成功率是通过准确性，召回率和f度量来计算的。结果是非常有希望的，因为已经观察到该方法可以产生准确和高质量的表示。

著录项

来源
《International Conference on Artificial Intelligence and Data Processing》|2018年|1-9|共9页
会议地点 Malatya(TR)
作者
Emre DOĞAN; Buket KAYA; Ahmet MÜNGEN;
展开▼
作者单位

Institute of Science Firat University Elazig/Turkey;

OSB Maden Vocational Higher School Firat University Elazig/Turkey;

Computer Engineering Firat University Elazig/Turkey;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Markov processes; Internet; Production; Encyclopedias; Natural language processing; Electronic publishing;

机译：马尔可夫过程；互联网;生产;百科全书自然语言处理；电子出版;

相似文献

外文文献
中文文献
专利

1. A systematic review of text classification research based on deep learning models in Arabic language [J] . Ahlam Wahdan, Sendeyah AL Hantoobi, Said A. Salloum, International Journal of Electrical and Computer Engineering . 2020,第6期

机译：基于阿拉伯语深度学习模型的文本分类研究系统综述
2. SicknessMiner: a deep-learning-driven text-mining tool to abridge disease-disease associations [J] . Rosário-Ferreira Nícia, Guimar?es Victor, Costa Vítor S., BMC Bioinformatics . 2021,第1期

机译：疾病：一个深受学习驱动的文本挖掘工具，用于缩短疾病疾病协会
3. SENTIMENT MINING AND ANALYSIS OVER TEXT CORPORA VIA COMPLEX DEEP LEARNING NEURAL ARCHITECTURES [J] . TERESA ALCAMO, ALFREDO CUZZOCREA, GIOVANNI PILATO, Journal of Data Intelligence . 2021,第4期

机译：通过复杂的深度学习神经结构对文本语料库的情感挖掘和分析
4. Generation of Original Text with Text Mining and Deep Learning Methods for Turkish and Other Languages [C] . Emre DO?AN, Buket KAYA, Ahmet MüNGEN International Conference on Artificial Intelligence and Data Processing . 2018

机译：用文本挖掘和土耳其语和其他语言的文本挖掘和深入学习方法的生成
5. Transfer Learning: Bridging the Gap Between Deep Learning and Domain-Specific Text Mining [D] . Cheng, Chaoran. 2020

机译：转移学习：弥合深度学习与域特定文本挖掘之间的差距
6. A Text Mining Pipeline Using Active and Deep Learning Aimed at Curating Information in Computational Neuroscience [O] . Matthew Shardlow, Meizhi Ju, Maolin Li, -1

机译：基于主动和深度学习的文本挖掘管道旨在计算神经科学中的信息管理
7. Emotion Correlation Mining Through Deep Learning Models on Natural Language Text [O] . Xinzhi Wang, Luyao Kou, Vijayan Sugumaran, 2020

机译：通过自然语言文本深入学习模型的情感相关挖掘

Generation of Original Text with Text Mining and Deep Learning Methods for Turkish and Other Languages

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅