An approximation method for extracting typical classes from semistructured data

机译：从半系统数据中提取典型类的近似方法

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We consider a class extraction problem over semistructured data. A class C is extracted by grouping objects having similar (not necessarily identical) sets of properties into C, where the set of properties of C is the union of those of the objects in C. Let C be an extracted class and o be an object in C. If C has property P but o has no property P value, then P is null within o. An extracted class c is called typical if the number of nulls in C is small against the number of object in C and the number of properties of C. We present the following results. First, we prove that the problem of deciding if a typical class can be extracted from given semistructured data is NP-complete. Second, we present an approximation algorithm for extracting typical classes from given semistructured data. Finally, we briefly discuss a sufficient condition for the approximation algorithm to run efficiently.

机译：我们考虑一个在半系统数据上提取问题。通过将具有类似（不一定相同）属性集合的对象来提取C类C，其中C的属性集是C中对象的联合。设为提取的类和O是对象在C.如果c有属性p但是o没有属性p值，则p在o内为null。如果C中的NULL的数量小于C中的对象数和C的属性的数量，则提取的C类被称为典型的C.我们呈现以下结果。首先，我们证明了决定是否可以从给定的半系统中提取典型类别的问题是NP-Complete。其次，我们介绍了一种从给定的半系统中提取典型类的近似算法。最后，我们简要讨论了足够的近似条件以有效运行。

著录项

来源
《International Symposium on Database Applications in Non-Traditional Environments》|2000年||共4页
会议地点
作者
Suzuki N.; Sato Y.; Institute of Electric and Electronic Engineer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Extracting Typical Classes and a Database Scheme from Semistructured Data [J] . Nobutaka Suzuki, Yoichirou Sato, Michiyoshi Hayase IEICE Transactions on Information and Systems . 2001,第1期

机译：从半结构化数据中提取典型类和数据库模式
2. LINEAR APPROXIMATION METHODS AND THE BEST APPROXIMATIONS OF THE POISSON INTEGRALS OF FUNCTIONS FROM THE CLASSES H_(ωp) IN THE METRICS OF THE SPACES L_p [J] . A. S. Serdyuk, I.V. Sokolenko Ukrainian mathematical journal . 2010,第7期

机译：空间L_p矩阵中H_（ωp）类的线性逼近方法和函数的泊松积分的最佳逼近
3. Extracting Local Schema from Semistructured Data Based on graph-Oriented Semantic Model [J] . Wang Tengjiao, Tang Shiwei, Yang Dongqing Journal of Computer Science & Technology . 2001,第6期

机译：基于图的语义模型从半结构化数据中提取局部模式
4. An approximation method for extracting typical classes from semistructured data [C] . Suzuki, N., Sato, . 2000

机译：一种从半结构化数据中提取典型类的近似方法
5. Methods for Extracting Data from the Internet [D] . Willers, Joel. 2017

机译：从互联网中提取数据的方法
6. Performance of a Natural Language Processing (NLP) Tool to Extract Pulmonary Function Test (PFT) Reports from Structured and Semistructured Veteran Affairs (VA) Data [O] . Brian C. Sauer, Barbara E. Jones, Gary Globe, -1

机译：从结构化和半结构化退伍军人事务（VA）数据提取肺功能测试（PFT）报告的自然语言处理（NLP）工具的性能
7. Extracting partition statistics from semistructured data [O] . Wilson, John N., Gourlay, Richard, Japp, Robert, 2006

机译：从半结构化数据中提取分区统计信息
8. Approximation and Data Fitting Methods: Part 1, Introduction to Numerical Approximation Methods [R] . Fritsch, F. N. 1986

机译：近似和数据拟合方法：第1部分，数值逼近方法简介

An approximation method for extracting typical classes from semistructured data

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅