首页> 外国专利> SELECTION OF TEXT CLASSIFIER PARAMETER BASED ON SEMANTIC CHARACTERISTICS

SELECTION OF TEXT CLASSIFIER PARAMETER BASED ON SEMANTIC CHARACTERISTICS

机译:基于语义特征的文本分类器参数选择

摘要

FIELD: physics.;SUBSTANCE: to evaluate the text classifier parameters based on semantic characteristics, the semantic-syntactic text analysis in natural language from the body of texts in natural language is performed using the processing device to create a semantic structure representing a set of semantic classes. The text characteristic in natural language is identified, extracted based on a set of values from a set of the characteristic extraction parameters. The body of texts in natural language is separated into a training data sample including the first set of texts in natural language, and a test sample including the second set of texts in natural language. A set of parameter values is defined for extracting characteristics, taking into account the category of the training sample. The obtained set of parameter values is evaluated for extracting characteristics using the test sample.;EFFECT: improving the accuracy of classification results.;20 cl, 15 dwg
机译:领域:物理学;实体:为了基于语义特征评估文本分类器参数,使用处理设备从自然语言的文本主体中进行自然语言的语义-句法文本分析,以创建代表一组语言的语义结构。语义类。识别自然语言的文本特征,并根据一组特征提取参数中的一组值来提取该特征。用自然语言编写的文本主体被分为训练数据样本,其中包括第一组使用自然语言编写的文本,以及一个测试样本,其中包含第二组使用自然语言编写的文本。考虑到训练样本的类别,定义了一组参数值以提取特征。使用测试样品评估获得的一组参数值以提取特征;效果:提高分类结果的准确性; 20 cl,15 dwg

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号