首页> 外国专利> APPARATUS, METHOD AND COMPUTER-ACCESSIBLE MEDIUM FOR EXPLAINING CLASSIFICATIONS OF DOCUMENTS

APPARATUS, METHOD AND COMPUTER-ACCESSIBLE MEDIUM FOR EXPLAINING CLASSIFICATIONS OF DOCUMENTS

机译:用于解释文件分类的装置,方法和计算机可访问的介质

摘要

Classification of collections of items such as words, which are called “document classification,” and more specifically explaining a classification of a document, such as a web-page or website. This can include exemplary procedure, system and/or computer-accessible medium to find explanations, as well as a framework to assess the procedure's performance. An explanation is defined as a set of words (e.g., terms, more generally) such that removing words within this set from the document changes the predicted class from the class of interest. The exemplary procedure system and/or computer-accessible medium can include a classification of web pages as containing adult content, e.g., to allow advertising on safe web pages only. The explanations can be concise and document-specific, and provide insight into the reasons for the classification decisions, into the workings of the classification models, and into the business application itself. Other exemplary aspects describe how explaining documents' classifications can assist in improving the data quality and model performance.
机译:诸如单词之类的项目集合的分类被称为“文档分类”,并且更具体地解释诸如网页或网站之类的文档的分类。这可以包括示例性过程,系统和/或计算机可访问的介质以找到解释,以及评估过程性能的框架。解释被定义为一组词(例如,更一般地,术语),使得从文档中去除该集合中的词会改变所关注类别的预测类别。示例性过程系统和/或计算机可访问介质可以包括网页的分类,其包含成人内容,例如,以仅允许在安全网页上做广告。这些解释可以简明扼要,针对特定文档,并且可以洞悉分类决策的原因,分类模型的工作原理以及业务应用程序本身。其他示例性方面描述了解释文档分类的方式如何有助于改善数据质量和模型性能。

著录项

  • 公开/公告号US2014229164A1

    专利类型

  • 公开/公告日2014-08-14

    原文格式PDF

  • 申请/专利权人 DAVID MARTENS;FOSTER PROVOST;

    申请/专利号US201214001242

  • 发明设计人 DAVID MARTENS;FOSTER PROVOST;

    申请日2012-02-23

  • 分类号G06F17/28;

  • 国家 US

  • 入库时间 2022-08-21 16:10:00

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号