首页> 外文会议>International Russian Automation Conference >Development of Multimethod Approach to Rubrication of Unstructed Electronic Text Documents in Various Conditions
【24h】

Development of Multimethod Approach to Rubrication of Unstructed Electronic Text Documents in Various Conditions

机译:多种条件下非结构化电子文本文档摩擦化方法的发展

获取原文

摘要

At present, the tools of information and communication interaction of the municipal and federal authorities with the citizens and organizations are actively developing. The increasing volume of electronic information leads to the need to classify multiple incoming messages. However, the specific features of such documents (small volume, lack of structuring, presence of grammatical and syntactic errors, thesaurus non-stationarity, etc.) make their statistical analysis more difficult. Also, a significant difference in the conditions of their processing does not allow using a universal method of the text document classification. This raises the urgent task of developing a multimethod approach to the rubrication of unstructured electronic documents based on the application of probabilistic and intelligent methods of analyzing text data. The computational experiments carried out in the context of interrelated and non-interrelated rubrics showed the prospects of their practical application.
机译:目前,市,联邦当局与公民和组织的信息和交流互动工具正在积极开发。电子信息量的增加导致需要对多个传入消息进行分类。但是,此类文档的特定特征(数量少,缺乏结构化,语法和句法错误的存在,同义词库不平稳等)使它们的统计分析更加困难。而且,它们处理条件的显着差异不允许使用通用的文本文档分类方法。这提出了紧迫的任务,即基于概率和智能方法来分析文本数据,开发一种多方法方法来对非结构化电子文档进行分类。在相互关联和不相互关联的标题的上下文中进行的计算实验表明了它们的实际应用前景。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号