首页> 外文会议>IEEE Conference on Visual Analytics Science anf Technology >FeatureInsight: Visual support for error-driven feature ideation in text classification
【24h】

FeatureInsight: Visual support for error-driven feature ideation in text classification

机译:特性istight:在文本分类中的错误驱动功能iDeation视觉支持

获取原文

摘要

Machine learning requires an effective combination of data, features, and algorithms. While many tools exist for working with machine learning data and algorithms, support for thinking of new features, or feature ideation, remains poor. In this paper, we investigate two general approaches to support feature ideation: visual summaries and sets of errors. We present FeatureInsight, an interactive visual analytics tool for building new dictionary features (semantically related groups of words) for text classification problems. FeatureInsight supports an error-driven feature ideation process and provides interactive visual summaries of sets of misclassified documents. We conducted a controlled experiment evaluating both visual summaries and sets of errors in FeatureInsight. Our results show that visual summaries significantly improve feature ideation, especially in combination with sets of errors. Users preferred visual summaries over viewing raw data, and only preferred examining sets when visual summaries were provided. We discuss extensions of both approaches to data types other than text, and point to areas for future research.
机译:机器学习需要有效的数据,功能和算法组合。虽然存在许多用于与机器学习数据和算法一起使用的工具,但对思考新功能或特征意思,而且仍然存在差距。在本文中,我们调查了两个支持特征意思的一般方法:视觉摘要和错误集。我们提出了ComponentIsIght,一个用于构建新的字典特征的交互式视觉分析工具(语义相关词组),用于文本分类问题。 techingInsight支持错误驱动的功能iDeation进程,并提供错误分类文档集的交互式视觉摘要。我们进行了一个受控实验,评估了特性中的视觉摘要和一组错误。我们的研究结果表明,视觉摘要显着提高了特征的特征,特别是与误差组合。用户首选视觉摘要通过查看原始数据,并且在提供视觉摘要时才仅优选的检查集。我们讨论两种方法的扩展到文本以外的数据类型,并指向未来研究的区域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号