首页>
外国专利>
Producing datasets for representing terms and objects based on automated learning from text contents
Producing datasets for representing terms and objects based on automated learning from text contents
展开▼
机译:根据对文本内容的自动学习,生成用于表示术语和对象的数据集
展开▼
页面导航
摘要
著录项
相似文献
摘要
A system and methods for creating data objects as symbolic or associative representations of terms or objects using machine-based methods are presented. A term can be a word or a phrase, which can also be the name of an object. For a given term, the methods analyze other terms associated with the term, and determine a set of terms or values to be attached to the term to form a dataset, either as a representation of the term, or as information about an object represented by the term, including various properties associated with the object. The methods include obtaining a group of text contents or non-natural language data contents, specifying a target term or symbol, and identifying contextual attributes of the target term or symbol. The contextual attributes include positional and distance attributes, as well as grammatical and semantic attributes.
展开▼