首页> 外国专利> METHOD FOR THE FULLY MODIFIABLE FRAMEWORK DISTRIBUTION OF DATA IN A DATA WAREHOUSE TAKING ACCOUNT OF THE PRELIMINARY ETYMOLOGICAL SEPARATION OF SAID DATA

METHOD FOR THE FULLY MODIFIABLE FRAMEWORK DISTRIBUTION OF DATA IN A DATA WAREHOUSE TAKING ACCOUNT OF THE PRELIMINARY ETYMOLOGICAL SEPARATION OF SAID DATA

机译:考虑所述数据的初步词源分离的数据仓库中数据的完全可修改框架分配方法

摘要

Method for the fully modifiable framework distribution of data in a data warehouse taking account of the preliminary etymological separation of said data is based on the framework model of data. It is about the totality of the entity-objects, that relate to a particular abstract domains, is distributed into five groups in an automated way: atomic, composite and weak entity-objects, as well as artifacts i.e. entity-copies the data of which are conventionally placed in warehouse, and a group of indefinite entity-objects, the semantics of which is the subject to further specification. The method provides for the option of replenishment of the algorithms groups and criteria for the separation, each of which allows for a more accurate classification of a particular entity-object to the above-mentioned groups. And their using consistently makes it possible to speed up the process and reduce the fifth group—the group of indefinite entity-objects, which have contradictory characteristics—they can be equally assigned to different groups. A few algorithms were shown. This is an algorithm based on using the dictionary of entity-objects, which is available in public networks and is constantly replenished, and on functional dependencies between the data from the entity-objects, which allows us to compare the entity-objects with each other; an algorithm for tracking some repeating entity-objects in binary pairs, the algorithm of the statistic analysis of the determinized or multi-valued dependencies, as well as the algorithms of successive approximations modifications on the connections' framework-template. This pre-separation of the entity objects set in the abstract domains makes it possible to simultaneously use both the relational properties and, for example, object-oriented model of data distribution. This provides the option to account for some artifacts, for which multiple domains masks are formed in the warehouse, each of which is assigned an identification key corresponding to its structure. Effectuating the Cartesian products of masks among themselves on an “each on each” principle, a complete set of composite entity objects is obtained. After that, they set aside some semantically incompatible ones from the obtained tables—for example, the result of multiplying two weak entity-objects that have a common ancestor. Thus, a logical and physical data schemas, which are equivalent to each other. This enables using of relational capabilities in a physically distributed data warehouse separated onto different servers. The method also solves the issue of standardization of data warehouse schemes creation.
机译:考虑到所述数据的初步词源分离,用于在数据仓库中对数据进行完全可修改的框架分布的方法是基于数据的框架模型。它涉及到与特定抽象域相关的实体对象的总数,并以自动化的方式被分为五个组:原子,复合和弱实体对象,以及工件(即实体),其数据被复制通常将它们放置在仓库中,并放置一组不确定的实体对象,其语义尚待进一步说明。该方法提供补充用于分离的算法组和标准的选项,每一个都允许将特定实体对象更准确地分类到上述组。并且它们的一致使用使加快处理过程和减少第五组(具有矛盾特征的不确定实体对象组)成为可能,它们可以平均分配给不同的组。显示了一些算法。这是一种基于使用实体对象字典的算法,该字典在公共网络中可用并且不断补充,并且基于实体对象数据之间的功能依赖性,这使我们可以将实体对象彼此进行比较;跟踪二进制对中某些重复实体对象的算法,确定或多值依赖关系的统计分析算法以及对连接的框架模板进行逐次逼近修改的算法。在抽象域中设置的实体对象的这种预分离使得可以同时使用关系属性和例如数据分布的面向对象模型。这提供了解决某些工件的选项,在仓库中为这些工件形成了多个域掩码,每个掩码都分配有与其结构相对应的标识密钥。按照“每个”原则实现蒙版的笛卡尔乘积,可以获得完整的一组复合实体对象。之后,它们从获得的表中保留一些语义上不兼容的表,例如,将两个具有相同祖先的弱实体对象相乘的结果。因此,逻辑和物理数据模式彼此等效。这使得可以在分离到不同服务器的物理分布式数据仓库中使用关系功能。该方法还解决了数据仓库方案创建的标准化问题。

著录项

  • 公开/公告号US2011307440A1

    专利类型

  • 公开/公告日2011-12-15

    原文格式PDF

  • 申请/专利权人 BORYS EVGENIJOVICH PANCHENKO;

    申请/专利号US201113215250

  • 发明设计人 BORYS EVGENIJOVICH PANCHENKO;

    申请日2011-08-23

  • 分类号G06F17/30;

  • 国家 US

  • 入库时间 2022-08-21 17:31:47

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号