首页> 外文OA文献 >Impact of different speech interfaces of personal devices on users' perception
【2h】

Impact of different speech interfaces of personal devices on users' perception

机译:个人设备的不同语音界面对用户感知的影响

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Because of Text-to-Speech (TTS) lacks both clarity and prosody of normal human speech, TTS sounds unnatural and is unpleasant to listen to. It is generally accepted using natural speech for a static prompts, whereas synthetic speech for dynamic content. However, most commercial applications on the market adopt mixing human speech and TTS within the same sentence and/or between sentences. But, this mixing approach led to inconsistent interface (Gong & Lai, 2001). So that, an immediate issue in the design of such speech interface is what type of speech should be used.The goal of this project is to explore users’ perception towards different types of speech in order to investigate the acceptability of personal speech interfaces. This study is aimed for the public users of mobile applications. This project explored redevelopment of the speech interface of the Goal Management Training (GMT) system based on results from testing different speech samples by the delivered VoiceTester mobile application. The VoiceTester application has been developed on the iPhone in this study, to facilitate the listening task therefore adding validity to the responses from participants by simulating environment of speech interfaces on personal devices. The contribution of this study is to provide some knowledge to the developers and health researchers about exploring the impact of different types of speech interfaces on users’ perception. The findings are ultimately helpful to the Traumatic Brain Injury (TBI) patients. As the recommended software will assist them undertake activities with support to help prevent them from making errors (McPherson, Kayes, & Weatherall, 2009).Six participants from different age groups have been chosen in the form of 3 couples, each couple construct of both genders. The examined types of speech are computer-generated voice (CV), natural voice (NV), and familiar voice (FV). The synthetic voices were generated by computer software, the natural speech samples were provided by two native speakers of New Zealand English, and the familiar voices for each couple were simply the recording of each other voices. Participants completed three times a post paper-and-pencil self-perception of task performance scales after each listening test, and then followed by an interview. The evaluative data were used to inform the participants and the researcher about the study and to guide the interview process. The main methods were largely qualitative through the use of semi-structured interviews to explore the users’ perception about manner of speaking and the speaker of the three examined speech samples, as well as, to investigate the importance of the used voice characteristics. The interviews are analysed to discover themes and patterns related to an analysis framework structured from the literature review.The findings revealed differences between three couples in their perceptions of different types of speech. The effect of gender was slightly present, as the subjects revealed a more positive attitude to their opposite gender. Both human voices, NV and FV, were acceptable to the majority of participants with many reporting improved mood and goal attainment. Participants found working with CV both challenging and rewarding. NV seemed particularly helpful in engaging people in the task process, while FV appeared particularly helpful in providing a structured framework for error prevention in attempting goal performance.
机译:由于文本语音转换(TTS)缺乏正常人的语音的清晰度和韵律感,因此TTS听起来不自然,听起来也不愉快。通常将自然语音用于静态提示,将合成语音用于动态内容。然而,市场上的大多数商业应用在同一句子内和/或句子之间采用混合人类语音和TTS。但是,这种混合方法导致界面不一致(Gong&Lai,2001)。因此,在这种语音界面的设计中,当务之急是使用哪种类型的语音。该项目的目的是探索用户对不同类型语音的感知,以研究个人语音界面的可接受性。这项研究针对的是移动应用程序的公共用户。该项目基于交付的VoiceTester移动应用程序测试不同语音样本的结果,探索了目标管理培训(GMT)系统语音界面的重新开发。在这项研究中,已经在iPhone上开发了VoiceTester应用程序,以简化收听任务,因此可以通过模拟个人设备上语音接口的环境为参与者的响应增加有效性。这项研究的目的是为开发人员和健康研究人员提供一些知识,以探索不同类型的语音界面对用户感知的影响。该发现最终对颅脑外伤(TBI)患者有帮助。由于推荐的软件将帮助他们开展活动并提供支持,以防止他们犯错(McPherson,Kayes和Weatherall,2009年)。我们从3对夫妇中选出了来自不同年龄段的6位参与者,每对夫妇都构建了这对夫妇性别。检查的语音类型为计算机生成的语音(CV),自然语音(NV)和熟悉的语音(FV)。合成语音是由计算机软件生成的,自然语音样本是由两名以新西兰英语为母语的人提供的,而每对夫妇所熟悉的语音只是彼此语音的记录。每次听力测试后,参与者完成纸笔后自我感知任务绩效量表的三倍,然后进行访谈。评估数据用于向参与者和研究人员介绍研究情况,并指导访谈过程。主要方法主要是通过使用半结构化访谈进行定性分析,以探讨用户对三种已检查语音样本的讲话方式和说话者的看法,以及调查使用的语音特征的重要性。通过对访谈进行分析,以发现与文献综述所构建的分析框架相关的主题和模式,这些发现揭示了三对夫妇在不同类型言语理解上的差异。性别的影响略显,因为受试者对异性表达了更积极的态度。大多数参与者都可以接受NV和FV这两种人的声音,其中许多人报告说他们的情绪和目标达成情况得到改善。参与者发现,与简历一起工作既具有挑战性,又有收获。 NV似乎特别有助于使人们参与任务过程,而FV似乎尤其有助于提供结构化框架以防止错误地实现目标表现。

著录项

  • 作者

    Wadea Mazen;

  • 作者单位
  • 年度 2011
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号