【24h】

Parametric classification over multiple samples

机译:多个样本的参数分类

获取原文
获取原文并翻译 | 示例

摘要

This pattern was originally designed to classify sequences of events in log files by error-proneness. Sequences of events trace application use in real contexts. As such, identifying error-prone sequences helps understand and predict application use. The classification problem we describe is typical in supervised machine learning, but the composite pattern we propose investigates it with several techniques to control for data brittleness. Data pre-processing, feature selection, parametric classification, and cross-validation are the major instruments that enable a good degree of control over this classification problem. In particular, the pattern includes a solution for typical problems that occurs when data comes from several samples of different populations and with different degree of sparcity.
机译:此模式最初旨在通过易错性对日志文件中的事件序列进行分类。事件序列跟踪应用程序在实际上下文中的使用。这样,识别容易出错的序列有助于理解和预测应用程序的使用。我们描述的分类问题是有监督机器学习中的典型问题,但是我们提出的复合模式使用几种控制数据脆性的技术对其进行了研究。数据预处理,特征选择,参数分类和交叉验证是可以很好地控制此分类问题的主要工具。特别是,该模式包括针对典型问题的解决方案,当数据来自不同总体且稀疏程度不同的多个样本时,就会发生这种问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号