A Mixed Neural Network and Support Vector Machine Model for Tender Creation in the European Union TED Database

机译：欧盟TED数据库中招标创建的混合神经网络和支持向量机模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This research article proposes a new method of automatized text generation and subsequent classification of the European Union (EU) Tender Electronic Daily (TED) text documents into predefined technological categories of the dataset. The TED dataset provides information about the respective tenders includes features like Name of project, Title, Description, Types of contract, Common procurement vocabulary (CPV) code, and Additional CPV codes. The dataset is obtained from the SIMAP-Information system for the European public procurement website, which is comprised of tenders described in XML files. The dataset was preprocessed using tokenization, removal of stop words, removal of punctuation marks etc. We implemented a neural machine learning model based on Long Short-Term Memory (LSTM) nodes for text generation and subsequent code classification. Text generation means that given a single line or just two or three words of the title, the model generates the sequence of a whole sentence. After generating the title, the model predicts the main applicable CPV code for that title. The LSTM model reaches an accuracy of 97% for the text generation and 95% for code classification using Support Vector Machine (SVM). This experiment is a first step towards developing a system that based on TED data is able to auto-generate and code classify tender documents, easing the process of creating and disseminating tender information to TED and ultimately relevant vendors. The development and automation of this system will future vision and understand current undergoing projects and the deliveries by a SIMAP-Information system for European public procurement tenders organisation based on the tenders published by it.

机译：本研究文章提出了一种新的自动化文本生成和随后分类欧盟（欧盟）招标电子日（TED）案文文件进入预定义的数据集技术类别。 TED DataSet提供有关各个招标的信息包括项目名称，标题，描述，合同类型，常见采购词汇（CPV）代码和其他CPV代码。数据集是从欧洲公共采购网站的SIMAP-Information系统获得的，该网站由XML文件中描述的招标组成。使用令牌化，删除停止单词，删除标点符号等数据集。我们基于文本生成和后续代码分类的长短短期内存（LSTM）节点来实现神经机学习模型。文本生成意味着给定单线或仅为标题的两个或三个单词，模型生成整个句子的序列。生成标题后，模型预测该标题的主要适用CPV代码。使用支持向量机（SVM），LSTM模型为文本生成达到97％的准确性，并且代码分类为95％。该实验是开发一个基于TED数据的系统的第一步，该系统能够自动生成和代码分类招标文件，缓解创建和传播投标信息的过程和最终相关的供应商。该系统的开发和自动化将来的愿景和理解基于其发表的招标的欧洲公共采购招标组织的SIMAP信息系统正在进行的项目和交付。

著录项

来源
《International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management》|2019年|1(CD-ROM)|共7页
会议地点
作者
Sangramsing Kayte; Peter Schneider-Kamp;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 G354-53;
关键词
Natural language processing; Natural language understanding; Named-entity recognition; Logistic regression; European union; Tender electronic daily; Common procurement vocabulary;

机译：自然语言处理;自然语言理解;命名实体识别;物流回归;欧洲联盟;嫩电子每日;常见的采购词汇;
入库时间 2022-08-20 20:17:38

相似文献

外文文献
中文文献
专利

1. Evaluation of insolvency in mutual credit unions by the models of artificial neural networks and support vector machines [J] . Isabel Cristina Gozer, Ant?nio Roberto Pereira leite de Albuquerque, Sadao Isotani, African Journal of Agricultural Research . 2014,第16期

机译：用人工神经网络和支持向量机模型评估相互信用联盟中的破产。
2. A new library to combine artificial neural networks and support vector machines with statistics and a database engine for application in environmental modeling [J] . Ralf Wieland, Wilfried Mirschel, Bernd Zbell, Environmental Modelling & Software . 2010,第4期

机译：一个新的库，它将人工神经网络和支持向量机与统计数据相结合，并提供一个数据库引擎以用于环境建模
3. Machine learning predictive models for mineral prospectivity: An evaluation of neural networks, random forest, regression trees and support vector machines [J] . Rodriguez-Galiano V., Sanchez-Castillo M., Chica-Olmo M., Ore Geology Reviews: Journal for Comprehensive Studies of Ore Genesis and Ore Exploration . 2015,第Null期

机译：机器学习预测模型的矿物前景：对神经网络，随机森林，回归树和支持向量机的评估
4. A Mixed Neural Network and Support Vector Machine Model for Tender Creation in the European Union TED Database [C] . Sangramsing Kayte, Peter Schneider-Kamp International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management . 2019

机译：欧盟TED数据库中招标创建的混合神经网络和支持向量机模型
5. Brainprint: Identifying unique features of neural activity using cross-correlation, support vector machines and neural networks to evaluate its use as an authentication method. [D] . Ruiz Blondet, Maria Virginia. 2014

机译：大脑印记：使用互相关，支持向量机和神经网络来识别神经活动的独特特征，以评估其作为身份验证方法的用途。
6. Comparing Artificial Neural Networks General Linear Models and Support Vector Machines in Building Predictive Models for Small Interfering RNAs [O] . Kyle A. McQuisten, Andrew S. Peek 2009

机译：在构建小干扰RNA的预测模型中比较人工神经网络通用线性模型和支持向量机
7. A Mixed Neural Network and Support Vector Machine Model for Tender Creation in the European Union TED Database [O] . Sangramsing Kayte, Peter Schneider-Kamp 2019

机译：欧盟TED数据库中招标创建的混合神经网络和支持向量机模型
8. Carbon Nanotube Growth Rate Regression using Support Vector Machines and Artificial Neural Networks. [R] . Westing, N. M. 2014

机译：基于支持向量机和人工神经网络的碳纳米管生长速率回归。

A Mixed Neural Network and Support Vector Machine Model for Tender Creation in the European Union TED Database

摘要

著录项

相似文献

相关主题

期刊订阅