首页> 外文会议>IEA/AIE 2010;International conference on industrial engineering and other applications of applied intelligent systems >Building Digital Ink Recognizers Using Data Mining: Distinguishing between Text and Shapes in Hand Drawn Diagrams
【24h】

Building Digital Ink Recognizers Using Data Mining: Distinguishing between Text and Shapes in Hand Drawn Diagrams

机译:使用数据挖掘构建数字墨水识别器:区分手绘图中的文本和形状

获取原文

摘要

The low accuracy rates of text-shape dividers for digital ink diagrams are hindering their use in real world applications. While recognition of handwriting is well advanced and there have been many recognition approaches proposed for hand drawn sketches, there has been less attention on the division of text and drawing. The choice of features and algorithms is critical to the success of the recognition, yet heuristics currently form the basis of selection. We propose the use of data mining techniques to automate the process of building text-shape recognizers. This systematic approach identifies the algorithms best suited to the specific problem and generates the trained recognizer. We have generated dividers using data mining and training with diagrams from three domains. The evaluation of our new recognizer on realistic diagrams from two different domains, against two other recognizers shows it to be more successful at dividing shapes and text with 95.2% of strokes correctly classified compared with 86.9% and 83.3% for the two others.
机译:用于数字墨水图的文本形状分隔器的低准确率正在阻碍它们在实际应用中的使用。尽管手写识别已得到很好的发展,并且已经提出了许多用于手绘草图的识别方法,但对文本和图形的划分的关注却很少。特征和算法的选择对于识别的成功至关重要,但是启发式方法目前是选择的基础。我们建议使用数据挖掘技术来自动化构建文本形状识别器的过程。这种系统的方法可以确定最适合特定问题的算法,并生成训练有素的识别器。我们使用数据挖掘和训练来生成来自三个域的图的分隔线。我们的新识别器在来自两个不同领域的现实图中相对于其他两个识别器的评估表明,在正确区分笔划的95.2%的笔画和文字上,这是比较成功的,而其他两个笔录则分别为86.9%和83.3%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号