首页> 外文会议>International conference on advances in speech and language technologies for Iberian languages >Detection of Publicity Mentions in Broadcast Radio: Preliminary Results
【24h】

Detection of Publicity Mentions in Broadcast Radio: Preliminary Results

机译:广播电台宣传中的检测:初步结果

获取原文

摘要

The advertising mentions are publicity contents that are not prerecorded, usually are said by radio or TV broadcasters to publicize a product or a company. The main difficulty of detecting advertising mentions is that the audio is not exactly repeated every time, as happens with conventional prerecorded advertising where more efficient techniques such as audio fingerprinting can be used. This paper proposes the use of a keyword search system in Spanish for the detection of advertising mentions. For that, it has been necessary to train and evaluate a new speech recognizer in Spanish (LVCSR) using the Kaldi tool and databases Fisher Spanish and Callhome Spanish. The best word error rate we have obtained on conversational telephone speech is 41.10 %. For the evaluation of mentions detection a specific database in Spanish has been created, containing 300 h of audio, 25 of which have been tagged with different types of information, including mentions appearing in the audio. The recognizer has been applied to all advertising mentions in search for mention specific keywords, achieving a detection rate of about 74 %.
机译:广告中提及的内容是未经预先录制的宣传内容,通常由广播电台或电视台说来宣传产品或公司。检测广告提及的主要困难在于,并非每次都能准确重复音频,就像传统的预录制广告一样,在这种情况下,可以使用更有效的技术,例如音频指纹识别。本文建议使用西班牙语的关键字搜索系统来检测广告内容。为此,有必要使用Kaldi工具和数据库Fisher Spanish和Callhome Spanish训练和评估新的西班牙语语音识别器(LVCSR)。我们在对话电话语音中获得的最佳单词错误率是41.10%。为了评估提及检测,已创建了一个西班牙语专用数据库,其中包含300个小时的音频,其中25个已用不同类型的信息标记,包括音频中出现的提及。该识别器已应用于搜索特定提及关键字的所有广告提及中,实现了约74%的检测率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号