首页> 外国专利> IDENTIFYING PRODUCT REFERENCES IN USER-GENERATED CONTENT

IDENTIFYING PRODUCT REFERENCES IN USER-GENERATED CONTENT

机译:标识用户生成的内容中的产品参考

摘要

Systems and methods are disclosed herein for extracting products referenced in a document. A document is analyzed to identify a product type that is referenced in the document. Attributes are extracted from the document. A set of candidate products are identified corresponding to the extracted attributes. A score is calculated for the candidate products and the products are further selected or filtered based on the score, whitelist rules, and blacklist rules in order to identify one or more inferred products referenced by the document. The whitelist and blacklist rules may take as inputs a domain, a user identifier, and keywords included in the document. A set of sufficient attributes may be identified for each product type. Selection of a candidate product may be based at least in part on the document including all of the attributes in the set of sufficient attributes.
机译:本文公开了用于提取文档中引用的产品的系统和方法。分析文档以标识文档中引用的产品类型。从文档中提取属性。识别与所提取的属性相对应的一组候选产品。计算候选产品的得分,并根据得分,白名单规则和黑名单规则进一步选择或过滤产品,以识别文档引用的一个或多个推断产品。白名单和黑名单规则可以将文档中包含的域,用户标识符和关键字作为输入。可以为每种产品类型标识一组足够的属性。候选产品的选择可以至少部分地基于包括足够属性集合中的所有属性的文档。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号