首页> 外文会议>First International Workshop on Pattern Recognition with Support Vector Machines SVM 2002, Aug 10, 2002, Niagara Falls, Canada >Support Vector Learning for Gender Classification Using Audio and Visual Cues: A Comparison
【24h】

Support Vector Learning for Gender Classification Using Audio and Visual Cues: A Comparison

机译:支持向量学习对使用音频和视频提示进行性别分类的比较

获取原文
获取原文并翻译 | 示例

摘要

Computer vision systems for monitoring people and collecting valuable demographics in a social environment will play an increasingly important role in enhancing user's experience and can significantly improve the intelligibility of a human computer interaction (HCI) system. For example, a robust gender classification system is expected to provide a basis for passive surveillance and access to a smart building using demographic information or can provide valuable consumer statistics in a public place. The option of an audio cue in addition to the visual cue promises a robust solution with high accuracy and ease-of-use in human computer interaction systems. This paper investigates the use of Support Vector Machines(SVMs) for the purpose of gender classification. Both visual (thumbnail frontal face) and audio (features from speech data) cues were considered for designing the classifier and the performance obtained by using each cue was compared. The performance of the SVM was compared with that of two simple classifiers namely, the nearest prototype neighbor and the k-nearest neighbor on all feature sets. It was found that the SVM outperformed the other two classifiers on all datasets. The best overall classification rates obtained using the SVM for the visual and speech data were 95.31% and 100%, respectively.
机译:在社会环境中用于监视人员并收集有价值的人口统计信息的计算机视觉系统在增强用户体验方面将扮演越来越重要的角色,并且可以显着提高人机交互(HCI)系统的清晰度。例如,一个强大的性别分类系统有望为使用人口统计信息进行被动监视和访问智能建筑提供基础,或者可以在公共场所提供有价值的消费者统计信息。除视觉提示外,还可以选择音频提示,从而保证了一种可靠的解决方案,具有很高的准确性,并且在人机交互系统中易于使用。本文研究了用于性别分类的支持向量机(SVM)的使用。视觉(缩略图正面)和音频(来自语音数据的特征)提示都被考虑用于设计分类器,并比较了使用每个提示获得的性能。将SVM的性能与两个简单分类器的性能进行了比较,即所有特征集上最近的原型邻居和k最近邻居。发现在所有数据集上,SVM均优于其他两个分类器。使用SVM获得的视觉和语音数据的最佳总体分类率分别为95.31%和100%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号