Optimizing copious activity type classes based on classification accuracy and entropy retention

Wim Ectors; Sofie Reumers; Won Do Lee; Bruno Kochan; Davy Janssens; Tom Bellemans; Geert Wets

首页> 外文期刊>Future generation computer systems >Optimizing copious activity type classes based on classification accuracy and entropy retention

【24h】

Optimizing copious activity type classes based on classification accuracy and entropy retention

机译：基于分类准确性和熵保留优化丰富的活动类型类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Despite the advantages, big transport data are characterized by a considerable disadvantage as well. Personal and activity-travel information are often lacking, making it necessary to deduce this information with data mining techniques. However, some studies predict many unique activity type classes (ATCs), while others merge multiple activity types into larger ATCs. This action enhances the activity inference estimation, but destroys important activity information. Previous studies do not provide a strong justification for this practice. An objectively optimized set of ATCs, balancing model prediction accuracy and preserving activity information from the original data, becomes essential. Previous research developed a classification methodology in which the optimal set of ATCs was identified by analyzing all possible ATC combinations. However, this approach is practically impossible in a finite amount of time for e.g. the US National Household Travel Survey (NHTS) 2009 data set, which comprises 36 ATCs (home activity excluded), since there would be 3.82 • 10~(30) unique combinations (an exponential increase). The aim of this paper is to optimize which original ATCs should be grouped into a new class, and this for data sets for which it is impossible or impractical to simply calculate all ATC combinations. The proposed method defines an optimization parameter U (based on classification accuracy and information retention) which is maximized in an iterative local search algorithm. The optimal set of ATCs for the NHTS 2009 data set was determined. A comparison finds that this optimum is considerably better than many expert opinion activity type classification systems. Convergence was confirmed and large performance gains were found.

机译：尽管存在优势，但大型传输数据也具有相当大的缺点。个人和活动旅行信息通常缺乏，使得有必要使用数据挖掘技术推断出这些信息。然而，一些研究预测了许多唯一的活动类型类（ATC），而其他研究将多个活动类型合并到较大的ATC中。此操作提高了活动推理估计，但销毁了重要的活动信息。以前的研究对于这种做法没有提供强大的理由。从原始数据的客观优化的ATC，平衡模型预测精度和保留活动信息的客观优化的ATC，变得重要。以前的研究开发了一种分类方法，其中通过分析所有可能的ATC组合来识别最佳ATC。然而，这种方法在例如有限的时间内实际上是不可能的。美国国家家庭旅游调查（NHTS）2009年数据集，其中包括36个ATCS（不包括家庭活动），因为将有3.82•10〜（30）个独特的组合（指数增加）。本文的目的是优化哪些原始ATC应将其分组为新类，并且这对于简单地计算所有ATC组合是不可能或不切实际的数据集。所提出的方法定义优化参数U（基于分类准确性和信息保留），其在迭代本地搜索算法中最大化。确定了NHTS 2009数据集的最佳ATC集。比较发现，这种最佳优点比许多专家意见活动类型分类系统更好。确认收敛性并发现了大的性能收益。

著录项

来源
《Future generation computer systems》 |2020年第9期|338-349|共12页
作者
Wim Ectors; Sofie Reumers; Won Do Lee; Bruno Kochan; Davy Janssens; Tom Bellemans; Geert Wets;
展开▼
作者单位

UHasselt - Hasselt University Transportation Research Institute (IMOB) Agoralaan 3590 Diepenbeek Belgium;

UHasselt - Hasselt University Transportation Research Institute (IMOB) Agoralaan 3590 Diepenbeek Belgium;

Manchester Metropolitan University Crime and Well-being Big Data Centre All saints M15 6BH Manchester England United Kingdom;

UHasselt - Hasselt University Transportation Research Institute (IMOB) Agoralaan 3590 Diepenbeek Belgium;

Uhasselt - Hasselt University Transportation Research Institute (IMOB) Agoralaan 3590 Diepenbeek Belgium;

Uhasselt - Hasselt University Transportation Research Institute (IMOB) Agoralaan 3590 Diepenbeek Belgium;

Uhasselt - Hasselt University Transportation Research Institute (IMOB) Agoralaan 3590 Diepenbeek Belgium;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Activity type classification; (Big) Transport data annotation; Optimal set of activity types; Local search algorithm; Classification accuracy; Entropy indices;

机译：活动类型分类;（大）运输数据注释;最佳活动类型;本地搜索算法;分类准确性;熵指数;

相似文献

外文文献
中文文献
专利

1. Developing an optimised activity type annotation method based on classification accuracy and entropy indices [J] . Ectors Wim, Reumers Sofie, Lee Won Do, Transportmetrica . 2017,第7a8期

机译：基于分类准确度和熵指标的优化活动类型标注方法
2. Biomedical classification application and parameters optimization of mixed kernel SVM based on the information entropy particle swarm optimization [J] . Mi Li, Xiaofeng Lu, Xiaodong Wang, Computer Assisted Surgery . 2016,第Suppla1期

机译：基于信息熵粒子群算法的混合核支持向量机生物医学分类应用及参数优化
3. Optimizing land cover classification accuracy for change detection, a combined pixel-based and object-based approach in a mountainous area in Mexico. [J] . Aguirre-Gutierrez J., Seijmonsbergen A. C., Duivenvoorden J. F. Applied Geography . 2012,第Null期

机译：优化土地覆盖分类精度以进行变化检测，这是墨西哥山区基于像素和基于对象的组合方法。
4. Entropy-based optimization to trade-off energy and accuracy for activity mobile sensing [C] . Taleb Sireen, Hajj Hazem, Dawy Zaher 2013 4th Annual International Conference on Energy Aware Computing Systems and Applications . 2013

机译：基于熵的优化，以权衡能量和活动移动感测的准确性
5. Land cover classification using satellite -sensed imagery and its texture values: An accuracy assessment based on the Florida Land Use and Cover Classification System. [D] . Shrestha, Tilak Bahadur. 1999

机译：使用卫星感应图像及其纹理值进行土地覆盖分类：基于佛罗里达州土地利用和覆盖分类系统的准确性评估。
6. A Rolling Bearing Fault Classification Scheme Based on k-Optimized Adaptive Local Iterative Filtering and Improved Multiscale Permutation Entropy [O] . Yi Zhang, Yong Lv, Mao Ge 2021

机译：一种基于K优化自适应局部迭代过滤和改进的多尺度置换熵的滚动轴承故障分类方案
7. Biomedical classification application and parameters optimization of mixed kernel SVM based on the information entropy particle swarm optimization [O] . Mi Li, Xiaofeng Lu, Xiaodong Wang, 2016

机译：基于信息熵粒子群优化的生物医学分类应用和混合核SVM的参数优化

Optimizing copious activity type classes based on classification accuracy and entropy retention

摘要

著录项

相似文献

相关主题

期刊订阅