首页> 外文会议>2010 IEEE Fourth International Conference on Semantic Computing >Semantic TagPrint - Tagging and Indexing Content for Semantic Search and Content Management
【24h】

Semantic TagPrint - Tagging and Indexing Content for Semantic Search and Content Management

机译:语义TagPrint-为内容进行标记和索引以进行语义搜索和内容管理

获取原文
获取外文期刊封面目录资料

摘要

Existing search and content management technology is facing a challenge of locating desired content with the exponentially growing volume of documents. An approach for mitigating this issue is to make use of user-generated tags. However, the improvements are limited because tags are (1) free from context and form, (2) user generated, (3) used for purposes other than description, and (4) often ambiguous. Since tagging is a voluntary action, some documents are not tagged at all. Furthermore, the interpretation of the tags associated with tagged documents also remains a challenge. To overcome these challenges, semantic web resources and technologies can be utilized to automatically generate semantic tags. Semantic tags not only reflect document content more accurately, they also enable better search results. Ontology coverage, ontology mapping and weighting significant ontological entities within a context are key challenges in semantic tagging systems. To address these challenges, this paper presents a semantic tagging system - Semantic TagPrint - to map a text document to semantic tags defined as entities in an ontology. Semantic TagPrint uses a linear time lexical chaining Word Sense Disambiguation (WSD) algorithm for real time concept mapping. In addition, it utilizes statistical metrics and ontological features of the ontology for weighting and recommending the semantic tags. A comparative evaluation shows that our mapping algorithm is fairly accurate and our tag recommendation algorithm performs better than other systems and algorithms.
机译:现有的搜索和内容管理技术正面临着随着文档数量成倍增长找到所需内容的挑战。缓解此问题的一种方法是利用用户生成的标签。但是,改进是有限的,因为标签(1)没有上下文和形式,(2)用户生成,(3)用于描述以外的目的,并且(4)通常不明确。由于标记是自愿行为,因此某些文档根本没有标记。此外,与加标签的文档相关联的标签的解释也仍然是一个挑战。为了克服这些挑战,可以利用语义Web资源和技术来自动生成语义标签。语义标签不仅可以更准确地反映文档内容,还可以提供更好的搜索结果。本体覆盖,本体映射以及在上下文中加权重要本体实体是语义标记系统中的关键挑战。为了解决这些挑战,本文提出了一种语义标记系统-Semantic TagPrint-将文本文档映射到定义为本体中实体的语义标记。语义TagPrint使用线性时间词法链接词义消除歧义(WSD)算法进行实时概念映射。另外,它利用统计度量和本体的本体特征来加权和推荐语义标签。一项比较评估表明,我们的映射算法相当准确,我们的标签推荐算法的性能优于其他系统和算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号