首页> 外文会议>World multiconference on systemics, cybernetics and informatics;SCI 2000 >Speech-to-Text Translation for Indexing and Searching of Audio/Visual Materials for a Digital Library,
【24h】

Speech-to-Text Translation for Indexing and Searching of Audio/Visual Materials for a Digital Library,

机译:语音到文本翻译,用于索引和搜索数字图书馆的音频/视频材料,

获取原文

摘要

Vast amounts of data exist in the form of audio/video tapes. In order for this data to be useful for digital libraries, it must be cataloged, indexed, and made easily searchable. In many cases the speech information contained in the audio track is of major interest. For example, a user might want to make a query of the form―find all instances where the words "balanced budget" and "tax cuts" are mentioned within 30 seconds of each other. Or a user may want to make a search based on both spoken materials and broad characteristics of the visual content. For example, a user make ask to find examples with the words "end of the cold war " are combined with visual clips of missiles. In order to make such queries feasible, a database must be properly labeled as to audio and video content, indexed, and integrated with appropriate database search tools. Software tools to prepare the audio/video content are now becoming available. In this paper, results are given as to the accuracy of the automatic speech recognition components of two of the leading software tools for creating such indexed digital libraries.
机译:大量数据以音频/录像带的形式存在。为了使此数据对数字图书馆有用,必须对其进行分类,索引并使其易于搜索。在许多情况下,音轨中包含的语音信息引起了人们的极大兴趣。例如,用户可能想查询以下形式:查找所有在彼此之间相隔30秒之内提到“平衡预算”和“减税”一词的实例。或者,用户可能希望同时基于口头材料和视觉内容的广泛特征进行搜索。例如,用户要求寻找单词“冷战结束”与导弹的可视剪辑相结合的示例。为了使这种查询可行,必须对音频和视频内容进行适当的标签标记,索引并与适当的数据库搜索工具集成。准备音频/视频内容的软件工具现已可用。在本文中,给出了用于创建这样的索引数字库的两个领先软件工具的自动语音识别组件的准确性的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号