首页> 外文会议>Proceedings of the Second International conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) >Chinese Document Information Processing Model Based on Random Walk Algorithm
【24h】

Chinese Document Information Processing Model Based on Random Walk Algorithm

机译:基于随机游走算法的中文文档信息处理模型

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we conduct research on Chinese document information processing model based on random walk algorithm. Because of the complexity and also the particularity of processing Chinese information, Chinese search engine technology needs to be improved. The Chinese search engine cannot directly copy foreign technology. To study and analyze the expertise of the Chinese, we can accurately find the need in vast information base as the Chinese information. In this paper, the dictionary learning and sparse representation with random walk model are introduced into the character recognition to solve the problem of pen character and noise of the fax characters. The novel analytic framework is presented to assist the processing of the methodologies. The recognition method does not require preprocessing operations such as character binarization and thinning, only one feature and one classifier is needed, compared with the current multi-feature multi-cascade classifier fusion recognition method, proposed recognition method has characteristics of low complexity. The test on the experiment also reflects the robustness of the proposed model.
机译:本文对基于随机游走算法的中文文档信息处理模型进行了研究。由于处理中文信息的复杂性和特殊性,需要改进中文搜索引擎技术。中文搜索引擎无法直接复制外国技术。通过研究和分析中文的专业知识,我们可以准确地找到庞大的信息基础作为中文信息的需求。本文将字典学习和具有随机游走模型的稀疏表示引入字符识别中,以解决笔形字符和传真字符的杂音问题。提出了新颖的分析框架以协助方法的处理。该识别方法不需要字符二值化和细化等预处理操作,只需要一个特征和一个分类器,与目前的多特征多级分类器融合识别方法相比,该识别方法具有低复杂度的特点。实验测试也反映了所提出模型的鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号