首页> 外国专利> Data analysis computer system and method for organizing, presenting, and optimizing predictive modeling

Data analysis computer system and method for organizing, presenting, and optimizing predictive modeling

机译：用于组织，呈现和优化预测建模的数据分析计算机系统和方法

页面导航

摘要
著录项
相似文献

摘要

Predictive modeling is an important class of data analytics with applications in numerous fields. Once a predictive model is built, validated, and applied on a set of objects, by a data analytics system (or even by manual modeling), consumers of the model information need assistance to navigate through the results. This is because both regression and classification models that output continuous values (eg, probability of belonging to a class) are often used to rank objects and then a thresholding of the ranked scores needs to be used to separate objects into a “positive” and a “negative” class. The choice of threshold greatly affects the true positive, false positive, true negative, and false negative results of the model's application. An ideal data analytics system should allow the user to understand the tradeoffs of different threshold values for different thresholds. The user interface should convey this information in an intuitive manner and provide the ability to vary the threshold interactively while simultaneously presenting the effects of thresholding on predictivity. This is precisely the function of the present invention. In addition to manual thresholding, the invention also allows for the thresholding to be performed by fully automated means (via standard statistical optimization methods) once a user has identified the desired balance of false positives and false negatives (or other predictivity metrics of interest). The invention can be applied to any application field of predictive modeling.

机译：预测建模是一类重要的数据分析，其应用领域众多。一旦通过数据分析系统（或什至通过手动建模）建立，验证并将预测模型应用于一组对象，模型信息的使用者就需要帮助来浏览结果。这是因为输出连续值（例如，属于某个类别的概率）的回归模型和分类模型通常都用于对对象进行排名，然后需要使用排名分数的阈值将对象分为“正”和“正”。 “负”类。阈值的选择会极大地影响模型应用程序的正确，错误肯定，正确否定和错误否定结果。理想的数据分析系统应允许用户了解不同阈值对于不同阈值的权衡。用户界面应以直观的方式传达此信息，并提供交互更改阈值的能力，同时呈现阈值对可预测性的影响。这恰好是本发明的功能。除了手动阈值化，一旦用户已经识别出假阳性和假阴性（或其他感兴趣的预测性度量）的期望平衡，本发明还允许通过全自动方式（通过标准统计优化方法）执行阈值化。本发明可以应用于预测建模的任何应用领域。

著录项

公开/公告号US9772741B2

专利类型
公开/公告日2017-09-26

原文格式PDF
申请/专利权人 KONSTANTINOS (CONSTANTIN) F. ALIFERIS;YIN APHINYANAPHONGS;LAWRENCE FU;ALEXANDER STATNIKOV;
展开▼

申请/专利号US201414216585
发明设计人 LAWRENCE FU;KONSTANTINOS (CONSTANTIN) F. ALIFERIS;YIN APHINYANAPHONGS;ALEXANDER STATNIKOV;
展开▼

申请日2014-03-17
分类号G06F17/00;G06N5/02;G06F3/0481;G06N99/00;G06Q50/18;
国家 US
入库时间 2022-08-21 13:46:01

相似文献

专利
外文文献
中文文献