Finding Similar Identities among Objects from Multiple Web Sources

机译：在多个Web源中的对象之间寻找相似的身份

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

When integrating data from multiple Web sources, objects can exist in different formats and structures, making it difficult to identify those that can be matched together. In this paper, we propose an identification approach to finding similar identities among objects from multiple Web sources. In this approach, object identification works like the relational join operation where a similarity function takes the place of the equality condition. This similarity function is based on information retrieval techniques. Our approach differs from others in the literature since it can be used to identify objects more complexly structured (e.g., XML documents) and not only objects with a flat structure such as relations. The effectiveness of our approach is demonstrated by experimental results with real Web data sources from different domains, that reach precision levels above 75%.

机译：当集成来自多个Web来源的数据时，对象可以以不同的格式和结构存在，从而难以识别可以匹配的对象。在本文中，我们提出了一种识别方法，可以从多个Web来源中找到对象之间的相似身份。在这种方法中，对象标识的工作方式类似于关系联接操作，其中相似性函数代替了相等条件。这种相似性功能基于信息检索技术。我们的方法与文献中的其他方法不同，因为它可以用于识别结构更复杂的对象（例如XML文档），而不仅用于识别关系等扁平结构的对象。通过使用来自不同领域的真实Web数据源的实验结果证明了我们方法的有效性，这些数据源的准确度达到75％以上。

著录项

来源
《ACM(Association for Computing Machinery) International Workshop on Web Information and Data Management(WIDM 2003); 20031107-20031108; New Orleans,LA; US》|2003年|P.90-93|共4页
会议地点 New Orleans LA(US);New Orleans LA(US);New Orleans LA(US);New Orleans LA(US)
作者
Joyce C. P. Carvalho; Altigran S. da Silva;
展开▼
作者单位

Department of Computer Science Federal University of Minas Gerais 31270-901 Belo Horizonte -MG -Brazil;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机网络;
关键词
web data integration; similarity;

机译：网络数据集成；相似性;

相似文献

外文文献
中文文献
专利

1. African American Ethnic and Class-Based Identities on the World Wide Web: Moderating the Effects of Self-Perceived Information Seeking/Finding and Web Self-Efficacy [J] . Jennifer R.Warren, rnMichael L. Hecht, rnEura Jung, Communication research . 2010,第5期

机译：万维网上的非裔美国人种族和基于阶级的身份：调节自我感知的信息寻找/查找和网络自我效能的影响
2. Cortical Circuit for Binding Object Identity and Location During Multiple-Object Tracking [J] . Nummenmaa Lauri, Oksama Lauri, Glerean Erico, Cerebral cortex . 2017,第1期

机译：用于绑定对象标识的皮质电路和多对象跟踪期间的位置
3. Shadowing and multiple rings in the protoplanetary disk of HD 139614 ★ ★★ [J] . G. A. Muro-Arena, M. Benisty, C. Ginski, Astronomy and astrophysics . 2020,第7期

机译：HD 139614的原始盘中的阴影和多个环★ ★★ <相关 - 对象对象类型=“tablecds”source-id =“http://cdsarc.u-strasbg.fr/viz-bin/cat/j/a anyla+a/635/a121”source-id- type =“URL”/>
4. Finding Similar Identities among Objects from Multiple Web Sources [C] . Joyce C. P. Carvalho, Altigran S. da Silva Association for Computing Machinery International Workshop on Web Information and Data Management . 2003

机译：从多个Web源中查找类似的身份
5. Comparative Mining of Multiple Web Data Source Contents with Object Oriented Model. [D] . Alahmad, Yanal. 2013

机译：使用面向对象模型比较挖掘多个Web数据源内容。
6. Cortical Circuit for Binding Object Identity and Location During Multiple-Object Tracking [O] . Lauri Nummenmaa, Lauri Oksama, Erico Glerean, -1

机译：在多对象跟踪过程中用于绑定对象标识和位置的皮质电路
7. Comparative Mining of Multiple Web Data Source Contents with Object Oriented Model [O] . Alahmad Yanal 2013

机译：面向对象模型的多个Web数据源内容的比较挖掘
8. Web-Enhanced Instruction and Learning: Findings of a Short- and Long-Term Impact Study and Teacher Use of NASA Web Resources [R] . McCarthy, Marianne C., Grabowski, Barbara L., Koszalka, Tiffany 2003

机译：网络增强的教学和学习：Nasa网络资源的短期和长期影响研究和教师使用的结果

Finding Similar Identities among Objects from Multiple Web Sources

摘要

著录项

相似文献

相关主题

期刊订阅