首页>
外国专利>
Contents attribute information normalization manner, from the data gathering service offer system, and the access document which
Contents attribute information normalization manner, from the data gathering service offer system, and the access document which
展开▼
机译:内容属性信息的规范化方式,来自数据收集服务提供系统,以及访问文件
展开▼
页面导航
摘要
著录项
相似文献
摘要
PROBLEM TO BE SOLVED: To construct a data base for providing a service without being restricted by the structure/form of a document for the perusal of an information provider by performing normalization processing of an attribute structure as of contents attribute information and performing normalization of a character expression form and normalization processing of numerical expression. ;SOLUTION: An automatic information collecting part 101 of an automatic information collecting and classifying device 100 patrols a Web site 120 of an information provider on a network 110, collects document files and extracts information. An attribute extraction part 102 normalizes a character code and then extracts only the contents attribute information as of the collected and extracted documents. An attribute normalization part 103 refers to a normalization rule 106 and normalizes the contents attribute information provided with the structure/form coincidently with a perusal document extracted by the attribute extraction part 102 to the form suited to a retrieval service or the like. Further, as of the contents attribute information whose the structure is normalized, a character expression form is normalized and numerical expression is normalized.;COPYRIGHT: (C)1999,JPO
展开▼