首页> 外文会议>IEEE Systems and Information Engineering Design Symposium >Data retrieval for client projects: Matching data onto an ontology map to produce a relevance assessment
【24h】

Data retrieval for client projects: Matching data onto an ontology map to produce a relevance assessment

机译:数据检索客户端项目:将数据匹配到本体图映射以产生相关性评估

获取原文

摘要

Discerning relevant data is becoming more difficult, time consuming, and costly as the amount of data available dramatically increases. Currently, consulting firms strive to use data to support client's business decisions with evidence. To be effective at this, consultants must consider the applicability of both internal and external data libraries to their clients' requirements. Frequently, evaluating the applicability of data sets is a manual process, which can be costly to the firm and the client. This paper describes a technical approach to automate this process. Specifically, it details the structure of a software application, named UVa Open Miner, capable of assessing the applicability of data sources to client projects. This UVa Open Miner aims to maximize the scale and diversity of candidate data sets, increase the relevance of data found, and maintain manageable computational complexity. UVa Open Miner consists of two segments: mapping and matching. The mapping component text mines web pages to identify an ontology of keywords describing the business requirement. This enables users to handle diverse business requirements from various industry verticals. The matching component scores data sets based on a relevance factor obtained from the ontology map. To validate the application, subject matter experts provided business requirements for a problem in their domain, and validated the application's results. Professionals in environment science, political science and policy-making fields found the application to be useful. Therefore, the application, along with the framework used, can be refactored into a reusable solution for consulting firms to use for their clients.
机译:辨别相关数据正变得越来越困难,耗时,并且昂贵,因为可用的数据量显着增加。目前,咨询公司努力使用数据来支持客户的业务决策与证据。为了有效,顾问必须考虑内部和外部数据库对客户要求的适用性。频繁地,评估数据集的适用性是手动过程,这可能对公司和客户端很昂贵。本文介绍了一种自动化此过程的技术方法。具体而言,它详细介绍了名为UVA开放矿工的软件应用程序的结构,能够评估数据源对客户端项目的适用性。这个UVA开放矿工旨在最大限度地提高候选数据集的规模和多样性,增加找到的数据的相关性,并保持可管理的计算复杂性。 UVA开放矿工由两个部分组成:映射和匹配。映射组件文本挖掘网页以识别描述业务需求的关键字的本体。这使用户能够处理各种行业垂直的不同业务需求。匹配组件基于从本体映射获得的相关性因子进行分数数据集。为了验证申请,主题专家专家为其域中提供了业务要求,并验证了申请的结果。环境科学,政治科学和政策制定领域的专业人士发现申请是有用的。因此,应用程序以及所使用的框架可以重构成可重复使用的解决方案,以便咨询公司为客户使用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号