首页> 外文会议>International conference on intelligent data engineering and automated learning >Convolutional Neural Network for Core Sections Identification in Scientific Research Publications

【24h】

Convolutional Neural Network for Core Sections Identification in Scientific Research Publications

机译：卷积神经网络用于科研出版物核心部分的识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The overwhelming volume of data generated online continuous to grow at an exponential and unprecedented rate. Over 80% of such data is unstructured. Scientific research publications constitute a significant portion of such unstructured data. Systematic literature review (SLR) activity is a rigorous and challenging process. The key challenge in SLR is the automatic extraction of the relevant data from the sheer volume of research publications. Lack of a unified framework has been identified as the key problem. A canonical model, based on the structure of the papers was proposed as the framework for data extraction purposes in SLR. Implemented as a classification problem, traditional machine learning models were used to realise the canonical model. A good accuracy was reported in these traditional models. However, there is room for improvement. This paper presents the result of the work on the same problem using convolutional neural network (CNN), which is more sophisticated (deeper). The results show an improvement over the traditional machine learning models with an accuracy of 85%. Unlike the previous CNN NLP works, this work also demonstrates the application of CNN on a bigger NLP dataset such as the data from the scientific research publications. The result also shows that the CNN performs even better in NLP tasks with bigger datasets.

机译：在线生成的压倒性数据量正以指数级和空前的速度持续增长。超过80％的此类数据是非结构化的。科学研究出版物构成了此类非结构化数据的重要部分。系统的文献综述（SLR）活动是一个严格而具有挑战性的过程。 SLR的主要挑战是从庞大的研究出版物中自动提取相关数据。缺乏统一框架已被确定为关键问题。提出了一种基于论文结构的典范模型作为SLR中数据提取目的的框架。作为分类问题，使用传统的机器学习模型来实现规范模型。在这些传统模型中报告了良好的准确性。但是，仍有改进的空间。本文介绍了使用更复杂（更深入）的卷积神经网络（CNN）对同一问题进行研究的结果。结果表明，与传统的机器学习模型相比，其准确度达到了85％。与以前的CNN NLP工作不同，该工作还演示了CNN在更大的NLP数据集（例如来自科研出版物的数据）中的应用。结果还表明，CNN在具有更大数据集的NLP任务中表现更好。

著录项

来源
《International conference on intelligent data engineering and automated learning 》|2019年|265-273|共9页
会议地点
作者
Bello Aliyu Muhammad; Rahat Iqbal; Anne James; Dianabasi Nkantah;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Data mining; Natural language processing; Machine learning; Neural network; Systematic literature review (SLR);

机译：数据挖掘;自然语言处理;机器学习;神经网络;系统文献综述（SLR）;

相似文献

外文文献
中文文献
专利

1. Cascade convolutional neural network-long short-term memory recurrent neural networks for automatic tonal and nontonal preclassification-based Indian language identification [J] . China Bhanja Chuya, Laskar Mohammad A., Laskar Rabul H. Expert Systems . 2020 ,第5期

机译：级联卷积神经网络长短期内存经常性神经网络，用于自动色调和非统计学预分配的印度语言识别
2. Optimizing convolutional neural networks to perform semantic segmentation on large materials imaging datasets: X-ray tomography and serial sectioning [J] . Stan Tiberiu, Thompson Zachary T., Voorhees Peter W. Materials Characterization . 2020 ,第期

机译：优化卷积神经网络在大型材料成像数据集上执行语义分割：X射线断层扫描和串联切片
3. Semantic segmentation of the multiform proximal femur and femoral head bones with the deep convolutional neural networks in low quality MRI sections acquired in different MRI protocols [J] . Memis Abbas, Varli Songul, Bilgili Fuat Computerized Medical Imaging and Graphics: The Official Jounal of the Computerized Medical Imaging Society . 2020 ,第期

机译：不同MRI协议中获取的低质量MRI部分中的多种近端股骨和股骨头骨骼和股骨头骨骼的语义分割
4. Convolutional Neural Network for Core Sections Identification in Scientific Research Publications [C] . Bello Aliyu Muhammad, Rahat Iqbal, Anne James, International Conference on Intelligent Data Engineering and Automated Learning . 2019

机译：科学研究出版物核心部分核心神经网络
5. Programmable Manycore Accelerator for Machine Learning, Convolution Neural Network and Binary Neural Network [D] . Kulkarni, Adwaya Amey. 2017

机译：面向机器学习，卷积神经网络和二进制神经网络的可编程Manycore加速器
6. CORENup: a combination of convolutional and recurrent deep neural networks for nucleosome positioning identification [O] . Domenico Amato, Giosue’ Lo Bosco, Riccardo Rizzo 2020

机译：Corenup：卷积和反复性深神经网络的组合核心定位鉴定
7. CORENup: a combination of convolutional and recurrent deep neural networks for nucleosome positioning identification [O] . Domenico Amato, Giosue’ Lo Bosco, Riccardo Rizzo 2020

机译：Corenup：卷积和反复性深神经网络的组合核心定位鉴定

Convolutional Neural Network for Core Sections Identification in Scientific Research Publications

摘要

著录项

相似文献

相关主题

期刊订阅