首页> 外文期刊>Advanced engineering informatics >A personalized query expansion approach for engineering document retrieval
【24h】

A personalized query expansion approach for engineering document retrieval

机译:用于工程文档检索的个性化查询扩展方法

获取原文
获取原文并翻译 | 示例

摘要

Engineers create engineering documents with their own terminologies, and want to search existing engineering documents quickly and accurately during a product development process. Keyword-based search methods have been widely used due to their ease of use, but their search accuracy has been often problematic because of the semantic ambiguity of terminologies in engineering documents and queries. The semantic ambiguity can be alleviated by using a domain ontology. Also, if queries are expanded to incorporate the engineer's personalized information needs, the accuracy of the search result would be improved. Therefore, we propose a framework to search engineering documents with less semantic ambiguity and more focus on each engineer's personalized information needs. The framework includes four processes: (1) developing a domain ontology, (2) indexing engineering documents, (3) learning user profiles, and (4) performing personalized query expansion and retrieval. A domain ontology is developed based on product structure information and engineering documents. Using the domain ontology, terminologies in documents are disambiguated and indexed. Also, a user profile is generated from the domain ontology. By user profile learning, user's interests are captured from the relevant documents. During a personalized query expansion process, the learned user profile is used to reflect user's interests. Simultaneously, user's searching intent, which is implicitly inferred from the user's task context, is also considered. To retrieve relevant documents, an expanded query in which both user's interests and intents are reflected is then matched against the document collection. The experimental results show that the proposed approach can substantially outperform both the keyword-based approach and the existing query expansion method in retrieving engineering documents. Reflecting a user's information needs precisely has been identified to be the most important factor underlying this notable improvement.
机译:工程师使用自己的术语创建工程文档,并希望在产品开发过程中快速而准确地搜索现有工程文档。基于关键字的搜索方法由于易于使用而被广泛使用,但是由于工程文档和查询中术语的语义含糊性,它们的搜索准确性经常成问题。可以通过使用领域本体来减轻语义上的歧义。同样,如果扩展查询以合并工程师的个性化信息需求,则搜索结果的准确性将得到提高。因此,我们提出了一个搜索框架,以减少语义歧义,并更多地关注每个工程师的个性化信息需求。该框架包括四个过程:(1)开发域本体;(2)为工程文档建立索引;(3)学习用户配置文件;以及(4)执行个性化查询扩展和检索。基于产品结构信息和工程文档开发领域本体。使用领域本体,文档中的术语会被消除歧义并建立索引。而且,从域本体生成用户配置文件。通过用户资料学习,可以从相关文档中捕获用户的兴趣。在个性化查询扩展过程中,学习到的用户配置文件用于反映用户的兴趣。同时,还考虑了从用户任务上下文中隐式推断出的用户搜索意图。为了检索相关文档,然后将反映用户兴趣和意图的扩展查询与文档集合进行匹配。实验结果表明,该方法在检索工程文档中可以大大优于基于关键字的方法和现有的查询扩展方法。准确地反映出用户的信息需求已被确定为这一显着改善的最重要因素。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号