【24h】

Identifying Hidden Contexts in Classification

机译:在分类中识别隐藏的上下文

获取原文

摘要

In this study we investigate how to identify hidden contexts from the data in classification tasks. Contexts are artifacts in the data, which do not predict the class label directly. For instance, in speech recognition task speakers might have different accents, which do not directly discriminate between the spoken words. Identifying hidden contexts is considered as data preprocessing task, which can help to build more accurate classifiers, tailored for particular contexts and give an insight into the data structure. We present three techniques to identify hidden contexts, which hide class label information from the input data and partition it using clustering techniques. We form a collection of performance measures to ensure that the resulting contexts are valid. We evaluate the performance of the proposed techniques on thirty real datasets. We present a case study illustrating how the identified contexts can be used to build specialized more accurate classifiers.
机译:在本研究中,我们调查如何从分类任务中的数据中识别隐藏的上下文。上下文是数据中的文物,它不会直接预测类标签。例如,在语音识别任务中,扬声器可能具有不同的可口音,不会直接区分口语。识别隐藏的上下文被视为数据预处理任务,可以帮助构建更准确的分类器,为特定的上下文量身定制,并深入了解数据结构。我们提出了三种技术来标识隐藏的上下文,它将类标签信息从输入数据中隐藏,使用聚类技术分区。我们构成了一系列性能措施,以确保产生的上下文有效。我们评估了三十个真实数据集的提出技术的性能。我们展示了一个案例研究,说明了所识别的上下文如何用于构建专门的更准确的分类器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号