首页> 外文会议>International conference on web engineering >Semantic Fingerprinting: A Novel Method for Entity-Level Content Classification
【24h】

Semantic Fingerprinting: A Novel Method for Entity-Level Content Classification

机译:语义指纹:实体级内容分类的一种新方法

获取原文

摘要

With the constantly growing Web, there is a need for automatically analyzing, interpreting and organizing contents. A particular need is given by the management of Web contents with respect to classification systems, e.g. based on ontologies in the LOD (Linked Open Data) cloud. Research in deep learning recently has shown great progress in classifying data based on large volumes of training data. However, "targeted" and fine-grained information systems require classification methods based on a relatively small number of "representative" samples. For that purpose, we present an approach that allows a semantic exploitation of Web contents and - at the same time - computationally efficient processing based on "Semantic Fingerprinting". To this end, we raise Web contents to the entity-level and exploit entity-related information that allows "distillation" and fine-grained classification of the Web content by its "semantic fingerprint". In experimental results on Web contents classified in Wikipedia, we show the superiority of our approach against state-of-the-art methods.
机译:随着Web的不断发展,需要自动分析,解释和组织内容。对于分类系统,例如Web系统,对Web内容的管理提出了特殊的需求。基于LOD(链接的开放数据)云中的本体。深度学习的研究最近显示了在基于大量训练数据进行数据分类方面的巨大进步。但是,“目标”和细粒度的信息系统需要基于相对少量的“代表性”样本的分类方法。为此,我们提出了一种方法,该方法允许对Web内容进行语义利用,并同时允许基于“语义指纹”的高效计算处理。为此,我们将Web内容提升到实体级别,并利用与实体相关的信息,这些信息允许通过其“语义指纹”对Web内容进行“提取”和细粒度分类。在分类为Wikipedia的Web内容的实验结果中,我们展示了我们的方法相对于最新方法的优越性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号