首页> 外文会议>International conference on wireless networks >An Open-Source Based Speech Recognition Android Application for Helping Handicapped Students Writing Programs
【24h】

An Open-Source Based Speech Recognition Android Application for Helping Handicapped Students Writing Programs

机译:一个基于开源的语音识别Android应用程序,可帮助残障学生编写程序

获取原文
获取外文期刊封面目录资料

摘要

We describe in this paper how to use open-source speech recognition technologies to design and implement an Android application that helps students with physical disabilities write programs in classrooms. Google Voice Recognition (GVR)[ ], which is a free and open Android tool, is utilized to convert the speech of a user to text. To fully utilize GVR, the Android phone has to be connected to the Internet. In a typical setup, a handicapped student sits in front of a workstation with a large computer screen like the one shown in Figure 6. The student speaks to an Android phone, which converts speech to text using GVR. The text is then sent to the workstation through Wi-Fi for parsing and analysis. The processed text, which is a code segment of a program, is displayed on the workstation screen. Since the Android main activity thread cannot handle too many activities, we save the text of the speech in a buffer and use another thread, named communication thread, to send it to the workstation using the standard socket API. The producer-consumer paradigm is employed to synchronize the generation of text data by the main activity thread and the sending of the data by the communication thread[ , , ]. The server program that runs on the workstation is written in C/C++. The main thread listens at a port. When it detects data, it creates two threads to handle the data. One thread created reads in the data, parses them into words, and puts the words in a circular queue. The other thread, named processing thread, simultaneously retrieves words from the queue, processes them to generate a code segment, saves the code in a file and displays it on the screen. A condition variable[ , ] is used to synchronize the tasks between these two threads. The keywords and symbols of the programming language that the student is using, which are saved in a file are loaded into a table. The processing thread uses a hashing and mapping scheme to obtain the proper keywords and symbols from the table; as humans often speak with inconsistency, several different words may map to the same keyword. For example, when one tries to say the word import, they may say it slightly different from the standard pronunciation and the recognizer generates the word important. The scheme will map important to the same location as import to retrieve the correct keyword.
机译:我们在本文中描述了如何使用开源语音识别技术来设计和实现一个Android应用程序,该应用程序可以帮助肢体残疾学生在教室里编写程序。 Google语音识别(GVR)[]是一种免费且开放的Android工具,用于将用户的语音转换为文本。要充分利用GVR,必须将Android手机连接到Internet。在典型的设置中,有残障的学生坐在工作站前面,该工作站具有大的计算机屏幕,如图6所示。该学生使用Android手机讲话,后者使用GVR将语音转换为文本。然后将文本通过Wi-Fi发送到工作站进行解析和分析。处理后的文本是程序的代码段,显示在工作站屏幕上。由于Android主要活动线程无法处理太多活动,因此我们将语音文本保存在缓冲区中,并使用另一个名为通信线程的线程使用标准套接字API将其发送到工作站。生产者-消费者范例用于通过主活动线程同步文本数据的生成和通过通信线程[,,]发送数据。在工作站上运行的服务器程序是用C / C ++编写的。主线程侦听端口。当检测到数据时,它将创建两个线程来处理数据。创建的一个线程读取数据,将其解析为单词,然后将单词放入循环队列中。另一个名为处理线程的线程同时从队列中检索单词,对其进行处理以生成代码段,然后将代码保存在文件中并将其显示在屏幕上。条件变量[,]用于在这两个线程之间同步任务。将学生使用的编程语言的关键字和符号(保存在文件中)加载到表格中。处理线程使用哈希和映射方案从表中获取适当的关键字和符号。由于人类经常会说不一致的话,所以几个不同的词可能会映射到同一关键字。例如,当尝试说出import单词时,他们可能会说它与标准发音略有不同,并且识别器会生成重要单词。该方案会将重要映射到与导入相同的位置,以检索正确的关键字。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号