Clustering-Based Schema Matching of Web Data for Constructing Digital Library

机译：基于聚类的Web数据模式匹配构建数字图书馆

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The abundant information on the web attracts many researches on reusing the valuable web data in other information applications, for example, digital libraries. Web information published by various contributors in different ways, schema matching is a basic problem for the heterogeneous data sources integration. Web information integration arises new challenges from the following ways: web data are short of intact schema definition; and the schema matching between web data can not be simplified as 1-1 mapping problem. In this paper we propose an algorithm, COSM, to automatic the web data schema matching process. The matching process is transformed into a clustering problem: the data elements clustered into one cluster are viewed as mapping ones. COSM is mainly instance-level matching approach, also combined with a partial name matcher in calculating the elements distance metrics. A pretreatment for data is carried out to give rational distance metrics between elements before clustering step. The experiment of algorithm testing and application (applied in the Chinese folk music digital library construction) proves the algorithm's efficiency.

机译：网络上的大量信息吸引了许多有关在其他信息应用程序（例如数字图书馆）中重用有价值的网络数据的研究。 Web信息由各种贡献者以不同方式发布，模式匹配是异构数据源集成的一个基本问题。 Web信息集成从以下方面提出了新的挑战：Web数据缺少完整的模式定义； Web数据之间的模式匹配不能简化为1-1映射问题。在本文中，我们提出了一种COSM算法，可以自动执行Web数据模式匹配过程。匹配过程转化为一个聚类问题：聚类为一个聚类的数据元素被视为映射元素。 COSM主要是实例级匹配方法，在计算元素距离度量时，还与部分名称匹配器结合使用。在聚类步骤之前，对数据进行了预处理以给出元素之间的合理距离度量。算法测试与应用实验（应用于中国民间音乐数字图书馆建设）证明了该算法的有效性。

著录项

来源
《International Conference on Computational Scinece and Its Applications(ICCSA 2005) pt.2; 20050509-12; Singapore(SG)》|2005年|P.1086-1095|共10页
会议地点 Singapore(SG)
作者
Hui Song; Fanyuan Ma; Chen Wang;
展开▼
作者单位

Department of Computer Information Technology, Donghua University, 200051 Shanghai, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类理论、方法;
关键词

相似文献

外文文献
中文文献
专利

1. Argumentation-based schema matching for multiple digital libraries [J] . Tho Thanh Quan, Xuan H. Luong, Thanh C. Nguyen, Online Information Review . 2015,第1期

机译：多个数字图书馆的基于参数的模式匹配
2. OCLCs Linked Data Initiative:Using Schema.org to Make Library Data Relevant on the Web [J] . TED FONS, JEFF PENKA, RICHARD WALLIS Information Standards Quarterly . 2012,第2a3期

机译：OCLC链接数据倡议：使用Schema.org 使Web上的图书馆数据具有相关性
3. Metadata Schema to Facilitate Linked Data for 3D Digital Models of Cultural Heritage Collections: A University of South Florida Libraries Case Study [J] . Xiying Mi, Bonita M. Pollock Cataloging & classification quarterly . 2018,第1a4期

机译：元数据架构可促进文化遗产收藏的3D数字模型的链接数据：南佛罗里达大学图书馆案例研究
4. Clustering-Based Schema Matching of Web Data for Constructing Digital Library [C] . Hui Song, Fanyuan Ma, Chen Wang International Conference on Computational Scinece and Its Applications . 2005

机译：基于聚类的基于Web数据构造数字库的模式匹配
5. A Web-accessible relational database for intact rock properties and an XML data format for intact rock properties with schema. [D] . Turichshev, Alexandr. 2002

机译：一个可访问Web的关系数据库，用于存储完整的岩石属性，以及XML数据格式，用于具有模式的完整的岩石属性。
6. Clever generation of rich SPARQL queries from annotated relational schema: application to Semantic Web Service creation for biological databases [O] . Julien Wollbrett, Pierre Larmande, Frédéric de Lamotte, 2013

机译：从带注释的关系模式中聪明地生成丰富的SPARQL查询：应用于生物学数据库的语义Web服务创建
7. On the Effectiveness of Automatic Schema Matching Over Heterogeneous Digital Libraries [O] . Renda Maria Elena, Straccia Umberto 2005

机译：异构数字图书馆自动模式匹配的有效性研究

Clustering-Based Schema Matching of Web Data for Constructing Digital Library

摘要

著录项

相似文献

相关主题

期刊订阅