首页> 外国专利> SYSTEMS AND METHODS FOR GENERALIZED STRUCTURED DATA DISCOVERY UTILIZING CONTEXTUAL METADATA DISAMBIGUATION VIA MACHINE LEARNING TECHNIQUES

SYSTEMS AND METHODS FOR GENERALIZED STRUCTURED DATA DISCOVERY UTILIZING CONTEXTUAL METADATA DISAMBIGUATION VIA MACHINE LEARNING TECHNIQUES

机译:通过机器学习技术利用上下文元数据消除歧义的广义结构化数据发现的系统和方法

摘要

A method for generalized structured data discovery may include (1) receiving physical application metadata from data sources for an attribute, a database object, or a database; (2) receiving reference data comprising a plurality of tokens and their associated abbreviations/acronyms; (3) parsing the physical application metadata into a application tokens comprising known and unknown application tokens; (4) identifying unknown application tokens by comparing the parsed application tokens to a corpus; (5) performing probabilistic parsing on the unknown application tokens using the reference data; (6) performing bi-directional encoding to expand the polysemous tokens to relevant expressions using the reference data; (7) applying language tokens to the relevant expressions in the expanded polysemous tokens to disambiguate the relevant expressions; and (8) outputting a mapping of the physical application metadata to enhanced physical application metadata, wherein the enhanced physical application metadata comprises an expression for the physical application metadata in a supported language.
机译:广义结构化数据发现的方法可以包括(1)从属性,数据库对象或数据库的数据源接收物理应用元数据; (2)接收包括多个令牌及其相关的缩写/首字母缩略词的参考数据; (3)将物理应用程序元数据解析为包括已知和未知应用令牌的应用令牌; (4)通过将解析的应用程序令牌与语料库进行比较来识别未知的应用程序令牌; (5)使用参考数据对未知应用令牌执行概率解析; (6)执行双向编码以使用参考数据将多态标记扩展到相关表达式; (7)将语言代币应用于扩建的多园标记中的相关表达,以消除相关表达; (8)输出物理应用元数据的映射到增强的物理应用元数据,其中增强的物理应用元数据包括以支持的语言的物理应用元数据的表达式。

著录项

  • 公开/公告号US2022067294A1

    专利类型

  • 公开/公告日2022-03-03

    原文格式PDF

  • 申请/专利权人 JPMORGAN CHASE BANK N.A.;

    申请/专利号US202017010023

  • 发明设计人 SANTOSH CHIKOTI;JEFFREY KESSLER;

    申请日2020-09-02

  • 分类号G06F40/30;G06F40/205;G06F40/253;G06N20;G06F16/2457;

  • 国家 US

  • 入库时间 2022-08-24 23:42:55

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号