首页>
外国专利>
METHOD AND APPARATUS FOR DETECTING DUPLICATE POSTS IN BLOG ENVIRONMENT
METHOD AND APPARATUS FOR DETECTING DUPLICATE POSTS IN BLOG ENVIRONMENT
展开▼
机译:在博客环境中检测重复帖子的方法和装置
展开▼
页面导航
摘要
著录项
相似文献
摘要
PURPOSE: A detecting method of a duplication post in a blog environment and an apparatus thereof for improving the quality of a search service by preventing that the duplication post is reflected in the blog search result are provided to efficiently perform the detection of duplication post by using a duplicate rate measuring function. CONSTITUTION: A duplicate decision index(311) stores an index keys through a new document. If it is not a duplicate document overlapped with, a search index(313) stores document. A blog database(315) offers the document selected according to the control of a query processing unit. A feature sequence extractor extracts the feature sequence. An index key generator generates combinable indexed keys through the feature sequence. A duplicate decision unit decides the duplication of document through the subsequence.
展开▼