首页> 外文期刊>Computer Science and Information Technology >Keyword Based Searching According to the Movie Names
【24h】

Keyword Based Searching According to the Movie Names

机译:基于电影名称的基于关键字的搜索

获取原文
           

摘要

Keyword based queries are inherently ambiguous such that given a set of keywords the database search engine has only an uncertain guess about the user’s informational need represented by the query. Possibly high complexity of the data makes providing intelligent search results effectively extremely challenging. Databases enable users to precisely express their informational needs using structured queries. However, database query construction is a laborious and error-prone process, which cannot be performed well by most end users. Keyword search alleviates the usability problem at the price of query expressiveness. As keyword search algorithms do not differentiate between the possible informational needs represented by a keyword query, users may not receive adequate results. This paper presents Extended Incremental Query Processing - a novel approach to bridge the gap between usability of keyword search and expressiveness of database queries. Extended Incremental Query Processing enables a user to start with an arbitrary keyword query and incrementally refine it into a structured query through an interactive interface. The enabling techniques of Extended Incremental Query Processing include: 1) A probabilistic framework for incremental query construction; 2) A probabilistic model to assess the possible informational needs represented by a keyword query; 3) An algorithm to obtain the optimal query construction process. This paper presents the detailed design of Extended Incremental Query Processing, and demonstrates its effectiveness and scalability through experiments over real-world data and a user study. Extracting information from semi structured documents is a very hard task. Documents are often so large that the data set returned as answer to a query may be too big to convey interpretable knowledge. In this, we describe an approach based on Tree-Based Association Rules (TARs): mined rules, which provide approximate, intentional information on both the structure and contents of XML documents. This mined knowledge is later used to provide: a concise idea—the gist—of both the structure and the content of the XML document .quick, approximate answers to queries.
机译:基于关键字的查询本质上是模棱两可的,因此,给定一组关键字,数据库搜索引擎只能对查询所代表的用户的信息需求做出不确定的猜测。数据的高度复杂性使得有效地提供智能搜索结果极具挑战性。数据库使用户能够使用结构化查询精确表达其信息需求。但是,数据库查询构造是一个费力且容易出错的过程,大多数最终用户无法很好地执行。关键字搜索以查询表现力为代价缓解了可用性问题。由于关键字搜索算法无法区分关键字查询所代表的可能的信息需求,因此用户可能无法获得足够的结果。本文提出了扩展增量查询处理-一种新的方法来弥合关键字搜索的可用性与数据库查询的表达能力之间的差距。扩展的增量查询处理使用户可以从任意关键字查询开始,然后通过交互式界面将其逐步细化为结构化查询。扩展的增量查询处理的实现技术包括:1)一个用于增量查询构建的概率框架; 2)概率模型,用于评估关键字查询所代表的可能的信息需求; 3)一种用于获取最佳查询构造过程的算法。本文介绍了扩展增量查询处理的详细设计,并通过对真实数据的实验和用户研究证明了其有效性和可扩展性。从半结构化文档中提取信息是一项非常艰巨的任务。文档通常太大,以至于作为查询答案返回的数据集可能太大,无法传达可解释的知识。在本文中,我们描述了一种基于树的关联规则(TAR)的方法:挖掘规则,它提供有关XML文档的结构和内容的近似,有意的信息。以后将使用这些知识来提供:XML文档的结构和内容的简要思想(要点)。快速,近似的查询答案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号