首页> 外国专利> METHOD AND APPARATUS FOR DETECTING DUPLICATE POSTS IN BLOG ENVIRONMENT

METHOD AND APPARATUS FOR DETECTING DUPLICATE POSTS IN BLOG ENVIRONMENT

机译:在博客环境中检测重复帖子的方法和装置

摘要

PURPOSE: A detecting method of a duplication post in a blog environment and an apparatus thereof for improving the quality of a search service by preventing that the duplication post is reflected in the blog search result are provided to efficiently perform the detection of duplication post by using a duplicate rate measuring function. CONSTITUTION: A duplicate decision index(311) stores an index keys through a new document. If it is not a duplicate document overlapped with, a search index(313) stores document. A blog database(315) offers the document selected according to the control of a query processing unit. A feature sequence extractor extracts the feature sequence. An index key generator generates combinable indexed keys through the feature sequence. A duplicate decision unit decides the duplication of document through the subsequence.
机译:目的:提供一种在博客环境中检测重复帖子的方法及其装置,其通过防止重复帖子反映在博客搜索结果中来提高搜索服务的质量,以通过使用高效地执行重复帖子的检测。重复率测量功能。构成:重复决策索引(311)通过新文档存储索引键。如果不是重叠的重复文档,则搜索索引(313)存储文档。博客数据库(315)提供根据查询处理单元的控制选择的文档。特征序列提取器提取特征序列。索引键生成器通过特征序列生成可组合的索引键。复制决定单元通过子序列决定文件的复制。

著录项

  • 公开/公告号KR100986404B1

    专利类型

  • 公开/公告日2010-10-08

    原文格式PDF

  • 申请/专利权人

    申请/专利号KR20080063126

  • 发明设计人 김상욱;박희진;이순행;이상철;

    申请日2008-06-30

  • 分类号G06Q50;

  • 国家 KR

  • 入库时间 2022-08-21 18:30:44

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号