A Data Mining System Based on SQL Queries and UDFs for Relational Databases

机译：基于SQL查询和关系数据库的UDF的数据挖掘系统

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Most research on data mining has proposed algorithms and optimizations that work on flat files, outside a DBMS, mainly due to the following reasons. It is easier to develop efficient algorithms in a traditional programming language. The integration of data mining algorithms into a DBMS is difficult given its relational model foundation and system architecture. Moreover, SQL may be slow and cumbersome for numerical analysis computations. Therefore, data mining users commonly export data sets outside the DBMS for data mining processing, which creates a performance bottleneck and eliminates important data management capabilities such as query processing and security, among others (e.g. concurrency control and fault tolerance). With that motivation in mind, we developed a novel system based on SQL queries and User-Defined Functions (UDFs) that can directly analyze relational tables to compute statistical models, storing such models as relational tables as well. Most algorithms have been optimized to reduce the number of passes on the data set;. Our system can analyze large and high dimensional data sets faster than external data mining tools.

机译：大多数关于数据挖掘的研究都提出了在DBMS之外的平面文件上工作的算法和优化，主要原因是以下原因。更容易以传统的编程语言开发高效的算法。给出了其关系模型基础和系统架构，难以将数据挖掘算法集成到DBMS中。此外，对于数值分析计算，SQL可能是缓慢和繁琐的。因此，数据挖掘用户在DBMS之外的通常导出数据集以进行数据挖掘处理，其创建性能瓶颈，并消除了诸如查询处理和安全性的重要数据管理能力（例如，并发控制和容错）。通过考虑到这一动机，我们开发了一种基于SQL查询和用户定义的功能（UDFS）的新型系统，可以直接分析关系表来计算统计模型，也可以将这些模型存储为关系表。大多数算法已被优化以减少数据集上的通行证数量;我们的系统可以分析大而高维数据集比外部数据挖掘工具。

著录项

来源
《ACM international conference on information and knowledge management》|2011年||共4页
会议地点
作者
Carlos Ordonez; Carlos Garcia-Alvarado;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词
Algorithms; Languages; Performance; Theory;

机译：算法;语言;性能;理论;

相似文献

外文文献
中文文献
专利

1. An adaptive spark-based framework for querying large-scale NoSQL and relational databases [J] . Eman Khashan, Ali Eldesouky, Sally Elghamrawy PLoS One . 2021,第8期

机译：用于查询大型NoSQL和关系数据库的自适应火花框架
2. Quality-Based SQL: Specifying Information Quality in Relational Database Queries [J] . Parssian Amir, Yeoh William, Ee Mong Shan Computer . 2015,第9期

机译：基于质量的SQL：在关系数据库查询中指定信息质量
3. Querying Uncertain Data in Geospatial Object-relational Databases Using SQL and Fuzzy Sets [J] . R. ?ura?iová Slovak Journal of Civil Engineering . 2014,第4期

机译：使用SQL和模糊集查询地理空间对象关系数据库中的不确定数据
4. A Data Mining System Based on SQL Queries and UDFs for Relational Databases [C] . Carlos Ordonez, Carlos Garcia-Alvarado ACM international conference on information and knowledge management . 2011

机译：基于SQL查询和UDF的关系数据库数据挖掘系统
5. Examining the Relationship Between Query Performances when Using Different Data Models Within Relational Database Systems [D] . Alsup, Andrew H. 2021

机译：在关系数据库系统中使用不同的数据模型时检查查询性能之间的关系
6. Executing Complexity-Increasing Queries in Relational (MySQL) and NoSQL (MongoDB and EXist) Size-Growing ISO/EN 13606 Standardized EHR Databases [O] . Ricardo Sánchez-de-Madariaga, Adolfo Muñoz, Antonio L Castro, 2018

机译：在关系型（MySQL）和NoSQL型（MongoDB和EXist）增长大小的ISO / EN 13606标准化EHR数据库中执行增加复杂性的查询
7. Executing Complexity-Increasing Queries in Relational (MySQL) and NoSQL (MongoDB and EXist) Size-Growing ISO/EN 13606 Standardized EHR Databases [O] . Ricardo Sánchez-de-Madariaga, Adolfo Muñoz, Antonio L Castro, 2018

机译：在关系（MySQL）和NoSQL（MongoDB和存在）中执行复杂性越来越多的查询（MongoDB和存在）尺寸生长ISO / EN 13606标准化的EHR数据库
8. Comparison of SQL, QBE, and DFQL as Query Languages for Relational Databases. [R] . Girsang, P. 1994

机译：sQL，QBE和DFQL作为关系数据库查询语言的比较。

A Data Mining System Based on SQL Queries and UDFs for Relational Databases

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅