【24h】

I See What You Say (ISWYS): Arabic lip reading system

机译:我明白你在说什么(ISWYS):阿拉伯语唇读系统

获取原文
获取原文并翻译 | 示例

摘要

The ability of communicating easily with everyone is a blessing people with hearing impairment do not have. They completely rely on their vision around healthy individuals to difficultly read lips. This paper proposes a solution for this problem, ISWYS (I See What You Say) is a research-oriented speech recognition system for the Arabic language that interprets lips movements into readable text. It is accomplished by analyzing a video of lips movements that resemble utterances, and then converting it to readable characters using video analysis and motion estimation. Our algorithm involves dividing the video into n number of frames to generate n-1 image frame which is produced by taking the difference between consecutive frames. Then, video features are extracted to be used by our error function which provided recognition of approximately 70%.
机译:与所有人轻松交流的能力是听力障碍人士所没有的祝福。他们完全依靠对健康个体的视力来难以阅读嘴唇。本文提出了解决此问题的方法,ISWYS(我明白您说的话)是一种面向研究的阿拉伯语语音识别系统,可以将嘴唇的运动解释为可读的文本。它是通过分析类似于发声的嘴唇运动视频,然后使用视频分析和运动估计将其转换为可读字符来实现的。我们的算法包括将视频分为n个帧,以生成n-1个图像帧,该图像帧是通过获取连续帧之间的差异而产生的。然后,提取视频特征以供我们的误差函数使用,该误差函数可提供约70%的识别率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号