Multi-pattern generation framework for logical analysis of data

Chou Chun-An; Bonates Tiberius O.; Lee Chungmok; Chaovalitwongse Wanpracha Art

首页> 外文期刊>Annals of Operations Research >Multi-pattern generation framework for logical analysis of data

【24h】

Multi-pattern generation framework for logical analysis of data

机译：用于数据逻辑分析的多模式生成框架

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Logical analysis of data (LAD) is a rule-based data mining algorithm using combinatorial optimization and boolean logic for binary classification. The goal is to construct a classification model consisting of logical patterns (rules) that capture structured information from observations. Among the four steps of LAD framework (binarization, feature selection, pattern generation, and model construction), pattern generation has been considered the most important step. Combinatorial enumeration approaches to generate all possible patterns were mostly studied in the literature; however, those approaches suffered from the computational complexity of pattern generation that grows exponentially with data (feature) size. To overcome the problem, recent studies proposed column generation-based approaches to improve the efficacy of building a LAD model with a maximum-margin objective. There was still a difficulty in solving subproblems efficiently to generate patterns. In this study, a new column generation framework is proposed, in which a new mixed-integer linear programming approach is developed to generate multiple patterns having maximum coverage in subproblems at each iteration. In addition to the maximum-margin objective, we propose an alternative objective (minimum-pattern) to solve the LAD problem as a minimum set covering problem. The proposed approaches are evaluated on the datasets from the University of California Irvine Machine Learning Repository. The computational experiments provide comparable performances compared with previous LAD and other well-known classification algorithms.

机译：数据逻辑分析（LAD）是一种基于规则的数据挖掘算法，使用组合优化和布尔逻辑进行二进制分类。目标是构建一个由逻辑模式（规则）组成的分类模型，该逻辑模式从规则中捕获结构化信息。在LAD框架的四个步骤（二进制化，特征选择，模式生成和模型构建）中，模式生成被认为是最重要的步骤。文献中大部分研究了组合枚举方法来生成所有可能的模式。但是，这些方法遭受的是模式生成的计算复杂性，该复杂性随数据（特征）的大小呈指数增长。为了克服这个问题，最近的研究提出了基于列生成的方法，以提高建立具有最大利润率目标的LAD模型的效率。有效解决子问题以生成模式仍然存在困难。在这项研究中，提出了一种新的列生成框架，其中开发了一种新的混合整数线性规划方法，以在每次迭代中生成在子问题中具有最大覆盖率的多个模式。除了最大利润率目标之外，我们还提出了一个替代目标（最小模式）来解决LAD问题，将其作为最小集覆盖问题。在加州大学尔湾分校机器学习存储库的数据集中对提出的方法进行了评估。与以前的LAD和其他众所周知的分类算法相比，计算实验提供了可比的性能。

著录项

来源
《Annals of Operations Research》 |2017年第2期|329-349|共21页
作者
Chou Chun-An; Bonates Tiberius O.; Lee Chungmok; Chaovalitwongse Wanpracha Art;
展开▼
作者单位

SUNY Binghamton, Dept Syst Sci & Ind Engn, Vestal, NY 13850 USA;

Univ Fed Ceara, Dept Stat & Appl Math, Fortaleza, CE, Brazil;

Hankuk Univ Foreign Studies, Dept Ind & Management Engn, Yongin 449791, Gyeonggi Do, South Korea;

Univ Washington, Dept Ind & Syst Engn, Seattle, WA 98195 USA|Univ Washington, Dept Radiol, Seattle, WA 98195 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Logical analysis of data; Combinatorial optimization; Column generation; Pattern mining; Classification;

机译：数据逻辑分析;组合优化;列生成;模式挖掘;分类;

相似文献

外文文献
中文文献
专利

1. Integrated optimization model and algorithm for pattern generation and selection in logical analysis of data [J] . Ouyang Ruilin, Chou Chun-An Computers & operations research . 2020,第Deca期

机译：数据逻辑分析模式生成和选择的集成优化模型与算法
2. A pool-based pattern generation algorithm for logical analysis of data with automatic fine-tuning [J] . Caserta Marco, Reiners Torsten European Journal of Operational Research . 2016,第2期

机译：基于池的模式生成算法，通过自动微调对数据进行逻辑分析
3. A new column generation algorithm for Logical Analysis of Data [J] . Pierre Hansen, Christophe Meyer Annals of Operations Research . 2011,第Auga期

机译：数据逻辑分析的新列生成算法
4. Development of an Integrated Framework for Protein Structure Determinations: A Logical Data Model for NMR Data Analysis [C] . Ellis H.J.C., Fox-Erlich S., Martyn T.O., Information Technology: New Generations, 2006. ITNG 2006. Third International Conference on . 2006

机译：蛋白质结构测定综合框架的开发：NMR数据分析的逻辑数据模型
5. Data sharing between transportation planning and public health: Issues and opportunities using a regional eco-logical framework [D] . Berry, Michele 2011

机译：运输计划与公共卫生之间的数据共享：使用区域生态学框架的问题和机遇
6. Building capacity for evidence generation synthesis and implementation to improve the care of mothers and babies in South East Asia: methods and design of the SEA-ORCHID Project using a logical framework approach [O] . Steve McDonald, Tari Turner, Catherine Chamberlain, 2010

机译：建立证据生成综合和实施的能力以改善东南亚的母亲和婴儿的照料：采用逻辑框架方法的SEA-ORCHID项目的方法和设计
7. Analysis of interactions between inflammatory and vasoregulatory pathways in chronic heart failure: application of logical analysis of data, a novel data-mining tool = Analysis of interactions between inflammatory and vasoregulatory pathways in chronic heart failure: application of logical analysis of data, a novel data-mining tool [O] . Prohászka Zoltán, Aladzsity István, Cervenak László, 2012

机译：慢性心力衰竭中炎症与血管调节途径之间相互作用的分析：数据逻辑分析的应用，一种新型的数据挖掘工具=慢性心力衰竭中炎症与血管调节途径之间相互作用的分析：数据逻辑分析的应用，新颖数据采矿工具

Multi-pattern generation framework for logical analysis of data

摘要

著录项

相似文献

相关主题

期刊订阅