首页>
外国专利>
AUTOMATICALLY LABELING DATA USING CONCEPTUAL DESCRIPTIONS
AUTOMATICALLY LABELING DATA USING CONCEPTUAL DESCRIPTIONS
展开▼
机译:使用概念描述自动标记数据
展开▼
页面导航
摘要
著录项
相似文献
摘要
A system for automatically labeling data using conceptual descriptions. In one example, the system includes an electronic processor configured to generate unlabeled training data examples from one or more natural language documents and, for each of a plurality of categories, determine one or more concepts associated with a conceptual description of the category and generate a weak annotator for each of the one or more concepts. The electronic processor is also configured to apply each weak annotator to each training data example and, when a training data example satisfies a weak annotator, output a category associated with the weak annotator. For each training data example, the electronic processor determines a probabilistic distribution of the plurality of categories. For each training data example, the electronic processor labels the training data example with a category having the highest value in the probabilistic distribution determined for the training data example.
展开▼