首页> 外文会议>Advances in Science and Engineering Technology International Conferences >Scene to Text Conversion and Pronunciation for Visually Impaired People
【24h】

Scene to Text Conversion and Pronunciation for Visually Impaired People

机译:视障人士的场景到文本转换和发音

获取原文

摘要

The recent technological advancements are focusing on developing smart systems to improve the quality of life. Machine learning algorithms and artificial intelligence are becoming elementary tools, which are used in the establishment of modern smart systems across the globe. In this context, an effective approach is suggested for automated text detection and recognition for the natural scenes. The incoming image is firstly enhanced by employing Contrast Limited Adaptive Histogram Equalization (CLAHE). Afterward, the text regions of the enhanced image are detected by employing the Maximally Stable External Regions (MSER) feature detector. The non-text MSERs are removed by employing appropriate filters. The remaining MSERs are grouped into words. The text recognition is performed by employing an Optical Character Recognition (OCR) function. The extracted text is pronounced by using a suitable speech synthesizer. The proposed system prototype is realized. The system functionality is verified with the help of an experimental setup. Results prove the concept and working principle of the devised system. It shows the potential of employing the suggested method for the development of modern devices for visually impaired people.
机译:最近的技术进步集中在开发智能系统以改善生活质量上。机器学习算法和人工智能正在成为基本工具,已在全球范围内用于建立现代智能系统。在这种情况下,建议一种有效的方法来自动检测和识别自然场景。首先通过使用对比度受限的自适应直方图均衡化(CLAHE)增强输入图像。然后,通过使用最大稳定外部区域(MSER)特征检测器来检测增强图像的文本区域。通过使用适当的过滤器可以删除非文本MSER。其余的MSER分为单词。通过使用光学字符识别(OCR)功能来执行文本识别。通过使用合适的语音合成器来发音提取的文本。所提出的系统原型得以实现。通过实验设置验证了系统功能。结果证明了所设计系统的概念和工作原理。它显示了采用建议的方法开发视力障碍者现代设备的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号