首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >Handwritten character classification using nearest neighbor in large databases
【24h】

Handwritten character classification using nearest neighbor in large databases

机译:大型数据库中使用最近邻居的手写字符分类

获取原文
获取原文并翻译 | 示例
           

摘要

Shows that systems built on a simple statistical technique and a large training database can be automatically optimized to produce classification accuracies of 99% in the domain of handwritten digits. It is also shown that the performance of these systems scale consistently with the size of the training database, where the error rate is cut by more than half for every tenfold increase in the size of the training set from 10 to 100,000 examples. Three distance metrics for the standard nearest neighbor classification system are investigated: a simple Hamming distance metric, a pixel distance metric, and a metric based on the extraction of penstroke features. Systems employing these metrics were trained and tested on a standard, publicly available, database of nearly 225,000 digits provided by the National Institute of Standards and Technology. Additionally, a confidence metric is both introduced by the authors and also discovered and optimized by the system. The new confidence measure proves to be superior to the commonly used nearest neighbor distance.
机译:表明基于简单统计技术和大型培训数据库的系统可以自动进行优化,以产生手写数字域中99%的分类精度。还表明,这些系统的性能与训练数据库的大小一致地扩展,其中,将训练集的大小从10个增加到100,000个示例,每增加十倍,错误率就会减少一半以上。研究了标准最近邻分类系统的三个距离度量:简单的汉明距离度量,像素距离度量和基于笔划特征提取的度量。使用这些指标的系统在美国国家标准技术研究院提供的标准的,可公开获取的,近225,000位的数据库中进行了培训和测试。此外,作者引入了置信度度量,并且系统还发现并优化了置信度度量。事实证明,新的置信度度量优于常用的最近邻居距离。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号