首页> 外国专利> Personality data mining method using a speech based dialog

Personality data mining method using a speech based dialog

机译:使用基于语音的对话的个性数据挖掘方法

摘要

A method for collecting data associated with the voice of a voice system user includes conducting a conversation with the user, capturing and digitizing a speech waveform of the user, extracting at least one acoustic feature from the digitized speech waveform and storing attribute data corresponding to the acoustic feature, together with an identifying indicia, in the data warehouse in a form to facilitate subsequent data mining. User attributes can include gender, age, accent, native language, dialect, socioeconomic classification, educational level and emotional state. Data gathering can be repeated for a large number of users, until sufficient data is present. The attribute data to be stored can include raw acoustic features, or processed features, such as the user's emotional state, age, gender, socioeconomic group, and the like. In an alternative form of method, the user attribute can be used to real-time modify behavior of the voice system, with or without storage of data for subsequent data mining.;An apparatus for collecting data associated with a voice of a user includes a dialog management unit, an audio capture module, an acoustic front end, a processing module and a data warehouse. The acoustic front end receives and digitizes a speech waveform from the user and extracts at least one acoustic feature from the digitized speech waveform. The feature is correlated with at least one user attribute. The processing module analyzes the acoustic feature to determine the user attribute, which can then be stored in the data warehouse. The dialog management unit can include, for example, a telephone interactive voice response system. The processor can be an application specific circuit, a separate general purpose computer with appropriate software, or a processor portion of the IVR. The processing module can include an emotional state classifier, a speaker clusterer and classifier, a speech recognizer, and/or an accent identifier. Alternatively, the apparatus can be configured as a real-time-modifiable voice system for interaction with a user, which can be used to practice the method for tailoring a voice system response.
机译:一种用于收集与语音系统用户的语音相关联的数据的方法,包括:与用户进行对话;捕获并数字化用户的语音波形;从数字化语音波形中提取至少一个声学特征;以及存储与语音系统用户相对应的属性数据。数据仓库中的声音特征以及识别标记,以利于后续数据挖掘。用户属性可以包括性别,年龄,口音,母语,方言,社会经济分类,教育程度和情绪状态。可以为大量用户重复数据收集,直到出现足够的数据为止。要存储的属性数据可以包括原始声学特征或处理过的特征,例如用户的情绪状态,年龄,性别,社会经济群体等。在方法的替代形式中,用户属性可以用于实时修改语音系统的行为,无论是否存储用于后续数据挖掘的数据。一种用于收集与用户语音相关联的数据的设备包括:对话管理单元,音频捕获模块,声学前端,处理模块和数据仓库。声学前端接收来自用户的语音波形并将其数字化,并从数字化语音波形中提取至少一个声学特征。该特征与至少一个用户属性相关。处理模块分析声学特征以确定用户属性,然后可以将其存储在数据仓库中。对话管理单元可以包括例如电话交互式语音响应系统。处理器可以是专用电路,具有适当软件的独立通用计算机或IVR的处理器部分。处理模块可以包括情绪状态分类器,说话者聚类和分类器,语音识别器和/或口音标识符。可替代地,该设备可以被配置为用于与用户交互的实时可修改的语音系统,其可以用于实践用于定制语音系统响应的方法。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号