Speech-to-Text Translation for Indexing and Searching of Audio/Visual Materials for a Digital Library,

机译：语音到文本翻译，用于索引和搜索数字图书馆的音频/视频材料，

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Vast amounts of data exist in the form of audio/video tapes. In order for this data to be useful for digital libraries, it must be cataloged, indexed, and made easily searchable. In many cases the speech information contained in the audio track is of major interest. For example, a user might want to make a query of the form―find all instances where the words "balanced budget" and "tax cuts" are mentioned within 30 seconds of each other. Or a user may want to make a search based on both spoken materials and broad characteristics of the visual content. For example, a user make ask to find examples with the words "end of the cold war " are combined with visual clips of missiles. In order to make such queries feasible, a database must be properly labeled as to audio and video content, indexed, and integrated with appropriate database search tools. Software tools to prepare the audio/video content are now becoming available. In this paper, results are given as to the accuracy of the automatic speech recognition components of two of the leading software tools for creating such indexed digital libraries.

机译：大量数据以音频/录像带的形式存在。为了使此数据对数字图书馆有用，必须对其进行分类，索引并使其易于搜索。在许多情况下，音轨中包含的语音信息引起了人们的极大兴趣。例如，用户可能想查询以下形式：查找所有在彼此之间相隔30秒之内提到“平衡预算”和“减税”一词的实例。或者，用户可能希望同时基于口头材料和视觉内容的广泛特征进行搜索。例如，用户要求寻找单词“冷战结束”与导弹的可视剪辑相结合的示例。为了使这种查询可行，必须对音频和视频内容进行适当的标签标记，索引并与适当的数据库搜索工具集成。准备音频/视频内容的软件工具现已可用。在本文中，给出了用于创建这样的索引数字库的两个领先软件工具的自动语音识别组件的准确性的结果。

著录项

来源
《World multiconference on systemics, cybernetics and informatics;SCI 2000》|2000年|p.415-417|共3页
会议地点
作者
Stephen A. Zahorian; Sudarshan Patilkulkarni; Montri Karnjanadecha; Charles Brewton;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. From ABC to http: The Effervescent Evolution of Indexing for Audiovisual Materials [J] . JAMES M. TURNER Cataloging & classification quarterly . 2010,第1期

机译：从ABC到http：音像资料索引的泡腾演变
2. Spatial/temporal indexing and information visualization genre for environmental digital libraries [J] . CHEN Su-shing, GRUNWALD Sabine Journal of Zhejiang University Science: An international applied physics & engineering journal . 2005,第11期

机译：环境数字图书馆的时空索引和信息可视化类型
3. Spatial/temporal indexing and information visualization genre for environmental digital libraries [J] . CHEN Su-shing, GRUNWALD Sabine Journal of Zhejiang University. Science, A . 2005,第11期

机译：环境数字图书馆的空间/时间索引和信息可视化类型
4. Indexing, Browsing, and Searching of Digital Video and Digital Audio Information [C] . Alan F. Smeaton International Workshop on Fast Software Encryption . 2001

机译：数字视频和数字音频信息的索引，浏览和搜索
5. Indexing and searching schemes for audio data in audio/multimedia databases. [D] . Subramanya, Srikantia Ramachandran. 1999

机译：音频/多媒体数据库中音频数据的索引和搜索方案。
6. Interlibrary loan of audiovisual materials in the health sciences: how a system operates in New Jersey. [O] . C M Crowley 1976

机译：卫生科学领域视听资料的馆际互借：系统在新泽西州的运作方式。
7. Indexing, Browsing and Searching of Digital Video and Digital Audio Information [O] . Alan F. Smeaton 2000

机译：数字视频和数字音频信息的索引，浏览和搜索
8. Digital Talking Book Distribution Analysis: Audio Book Distribution System Design Submitted to the Library of Congress, National Library Service for the Blind and Physically Handicapped for Digital Talking Book Distribution Analysis Task 4: Transition [R] . 2006

机译：数字通话书分布分析：音频书籍分发系统设计提交给国会图书馆，国家图书馆盲人和残疾人数字通话书籍分发服务分析任务4：过渡

Speech-to-Text Translation for Indexing and Searching of Audio/Visual Materials for a Digital Library,

摘要

著录项

相似文献

相关主题

期刊订阅