Preparing Heterogeneous XML for Full-Text Search

MIRO LEHTONEN

首页> 外文期刊>ACM Transactions on Information Systems >Preparing Heterogeneous XML for Full-Text Search

【24h】

Preparing Heterogeneous XML for Full-Text Search

机译：准备用于全文搜索的异构XML

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

XML retrieval is facing new challenges when applied to heterogeneous XML documents, where next to nothing about the document structure can be taken for granted. We have developed solutions where some of the heterogeneity issues are addressed. Our fragment selection algorithm selectively divides a heterogeneous document collection into equi-sized fragments with full-text content. If the content is considered too data-oriented, it is not accepted. The algorithm needs no information about element names. In addition, three techniques for fragment expansion are presented, all of which yield a 13-17% average improvement in average precision. These techniques and algorithms are among the first steps in developing document-type-independent indexing methods for the full text in heterogeneous XML collections.

机译：当将XML检索应用于异构XML文档时，面临着新的挑战，其中关于文档结构的几乎所有内容都可以认为是理所当然的。我们已经开发出解决某些异质性问题的解决方案。我们的片段选择算法选择性地将异构文档集合分为具有全文内容的大小相等的片段。如果认为内容过于面向数据，则不接受。该算法不需要有关元素名称的信息。此外，提出了三种用于片段扩展的技术，所有这些技术平均平均精度提高了13-17％。这些技术和算法是为异构XML集合中的全文本开发与文档类型无关的索引方法的第一步。

著录项

来源
《ACM Transactions on Information Systems》 |2006年第4期|p.455-474|共20页
作者
MIRO LEHTONEN;
展开▼
作者单位

Department of Computer Science, University of Helsinki, P.O. Box 68, FIN-00014 University of Helsinki, Finland;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
XML retrieval; indexing; heterogeneous documents;

机译：XML检索;建立索引;异构文档;

相似文献

外文文献
中文文献
专利

1. Enhancing XML search with XQuery 1.0 and XPath 2.0 Full-Text [J] . IBM Systems Journal . 2006,第2期

机译：使用XQuery 1.0和XPath 2.0全文增强XML搜索
2. Enhancing XML search with XQuery 1.0 and XPath 2.0 Full-Text [J] . P. Case IBM Systems Journal . 2006,第2期

机译：使用XQuery 1.0和XPath 2.0全文增强XML搜索
3. Construction and usage of full-text search engine system -How to create your web site with full-text search engine system- [J] . Harada Yoichi 情報管理 . 2001,第10期

机译：全文搜索引擎系统的构建和使用-如何使用全文搜索引擎系统创建您的网站-
4. TopX 2.0 at the INEX 2008 Efficiency Track A (Very) Fast Object-Store for Top-k-Style XML Full-Text Search [C] . Martin Theobald, Mohammed AbuJarour, Ralf Schenkel International Workshop of the Initiative for the Evaluation of XML Retrieval . 2009

机译：Topx 2.0在Inex 2008效率跟踪A（非常）Fast Object-Store for Top-K样式XML全文搜索
5. Democratic community-based search with XML full-text queries [D] . Curtmola, Emiran 2009

机译：使用XML全文查询的基于民主社区的搜索
6. Improving a full-text search engine: the importance of negation detection and family history context to identify cases in a biomedical data warehouse [O] . Nicolas Garcelon, Antoine Neuraz, Vincent Benoit, 2017

机译：完善全文搜索引擎：否定检测和家族历史背景的重要性以识别生物医学数据仓库中的案例
7. A texquery-based xml full-text search engine [O] . Chavdar Botev, Sihem Amer-yahia, Jayavel Shanmugasundaram 2015

机译：基于texquery的xml全文搜索引擎
8. Intelligent Search of Full-Text Databases [R] . Gauch, S., Smith, J. B. 1987

机译：智能搜索全文数据库

Preparing Heterogeneous XML for Full-Text Search

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅