首页> 外国专利> Data analysis computer system and method for organizing, presenting, and optimizing predictive modeling

Data analysis computer system and method for organizing, presenting, and optimizing predictive modeling

机译:用于组织,呈现和优化预测建模的数据分析计算机系统和方法

摘要

Predictive modeling is an important class of data analytics with applications in numerous fields. Once a predictive model is built, validated, and applied on a set of objects, by a data analytics system (or even by manual modeling), consumers of the model information need assistance to navigate through the results. This is because both regression and classification models that output continuous values (eg, probability of belonging to a class) are often used to rank objects and then a thresholding of the ranked scores needs to be used to separate objects into a “positive” and a “negative” class. The choice of threshold greatly affects the true positive, false positive, true negative, and false negative results of the model's application. An ideal data analytics system should allow the user to understand the tradeoffs of different threshold values for different thresholds. The user interface should convey this information in an intuitive manner and provide the ability to vary the threshold interactively while simultaneously presenting the effects of thresholding on predictivity. This is precisely the function of the present invention. In addition to manual thresholding, the invention also allows for the thresholding to be performed by fully automated means (via standard statistical optimization methods) once a user has identified the desired balance of false positives and false negatives (or other predictivity metrics of interest). The invention can be applied to any application field of predictive modeling.
机译:预测建模是一类重要的数据分析,其应用领域众多。一旦通过数据分析系统(或什至通过手动建模)建立,验证并将预测模型应用于一组对象,模型信息的使用者就需要帮助来浏览结果。这是因为输出连续值(例如,属于某个类别的概率)的回归模型和分类模型通常都用于对对象进行排名,然后需要使用排名分数的阈值将对象分为“正”和“正”。 “负”类。阈值的选择会极大地影响模型应用程序的正确,错误肯定,正确否定和错误否定结果。理想的数据分析系统应允许用户了解不同阈值对于不同阈值的权衡。用户界面应以直观的方式传达此信息,并提供交互更改阈值的能力,同时呈现阈值对可预测性的影响。这恰好是本发明的功能。除了手动阈值化,一旦用户已经识别出假阳性和假阴性(或其他感兴趣的预测性度量)的期望平衡,本发明还允许通过全自动方式(通过标准统计优化方法)执行阈值化。本发明可以应用于预测建模的任何应用领域。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号