首页>
外国专利>
Determining document subject by using title and anchor text of related documents
Determining document subject by using title and anchor text of related documents
展开▼
机译:使用相关文档的标题和锚文本确定文档主题
展开▼
页面导航
摘要
著录项
相似文献
摘要
A system and method identifies a subject for a source document. The system and method identifies a collection of peer documents from the same domain as the source document. For each of the peer documents, a collection of linking documents containing a hyperlink to the peer document is identified. For each of the peer documents, a label is generated by choosing the longest-match anchor text of the linking documents. A pattern between the labels and the titles of the collection of peer documents is deduced. The subject of the source document is identified by applying the pattern to the title of the source document.
展开▼