首页> 外文学位 >Information fusion for monolingual and cross-language spoken document retrieval.
【24h】

Information fusion for monolingual and cross-language spoken document retrieval.

机译:用于单语言和跨语言语音文档检索的信息融合。

获取原文
获取原文并翻译 | 示例

摘要

Spoken document retrieval (SDR) is an important technique that enables relevant information to be searched from spoken data archives. With the advent of Internet and multimedia technologies, the amount of available information sources in various media (text, audio, video etc.) is increasing very quickly. In order to make these available sources of information more readily usable, there are two major issues worth of investigations. First of all, since there is large quantity of information stored in audio format in addition to textual format, capability of searching among spoken data will allow one to make use of this large pool of data resources. Furthermore, the available information can also be presented in different languages. To handle this problem, information searching across different languages is also important.; In an information retrieval system, information is usually extracted by matching the indexing units at different scales. These scales include words as well as subwords (e.g. phonemes, characters or syllables). Among these indexing units, words are known to achieve higher precision in retrieval whilst subwords are more robust to errors within documents (e.g. recognition errors). In order to take advantage of these indexing units at multiple scales, we propose to apply information fusion to these units by multi-scale retrieval. Multi-scale retrieval refers to the use of both word and subword units for retrieval.; In this thesis, we shall present thorough investigations for both monolingual and cross-language spoken document retrieval tasks using the proposed multi-scale approach. We have investigated the multi-scale retrieval performances of the two most commonly used retrieval models, vector space model (VSM) and HMM-based model. The experiments are carried out using both monolingual Chinese (both Cantonese and Mandarin) and cross-language (English searching Mandarin) data archives. Specifically, in order to achieve multi-scale SDR with the HMM-based model, we have taken advantage of the probabilistic nature of this retrieval model to extend it for both cross-language information retrieval and subword scale retrieval. Furthermore, the success of multi-scale retrieval encourages us to extend this approach to multi-model as well as multi-scale and multi-model retrieval for SDR. Experiments on both monolingual and cross-language SDR tasks have demonstrated that the benefits of using these information fusion approaches are remarkable.
机译:语音文档检索(SDR)是一项重要技术,可从语音数据存档中搜索相关信息。随着Internet和多媒体技术的出现,各种媒体(文本,音频,视频等)中可用信息源的数量正在迅速增加。为了使这些可用的信息源更易于使用,有两个主要的问题值得研究。首先,由于除了文本格式外,还有大量以音频格式存储的信息,因此在语音数据中进行搜索的功能将使人们能够利用这一庞大的数据资源池。此外,可用信息也可以用不同的语言显示。为了解决这个问题,跨不同语言的信息搜索也很重要。在信息检索系统中,通常通过匹配不同比例的索引单元来提取信息。这些音阶包括单词和子单词(例如音素,字符或音节)。在这些索引单元中,已知单词在检索中实现更高的精度,而子单词对文档中的错误(例如,识别错误)更健壮。为了利用这些索引单位在多个尺度上的优势,我们建议通过多尺度检索将信息融合应用于这些单位。多尺度检索是指同时使用词和子词单元进行检索。在本文中,我们将使用拟议的多尺度方法对单语和跨语言的语音文档检索任务进行全面研究。我们研究了两种最常用的检索模型(向量空间模型(VSM)和基于HMM的模型)的多尺度检索性能。使用单语的中文(广东话和普通话)和跨语言(英文搜索的普通话)数据档案库进行实验。具体来说,为了使用基于HMM的模型实现多尺度SDR,我们利用了这种检索模型的概率性质,将其扩展为跨语言信息检索和子词规模检索。此外,多尺度检索的成功鼓励我们将这种方法扩展到SDR的多模型以及多尺度和多模型检索。单语言和跨语言SDR任务的实验表明,使用这些信息融合方法的好处非常明显。

著录项

  • 作者

    Lo, Wai-Kit.;

  • 作者单位

    Chinese University of Hong Kong (People's Republic of China).;

  • 授予单位 Chinese University of Hong Kong (People's Republic of China).;
  • 学科 Engineering Electronics and Electrical.; Computer Science.
  • 学位 Ph.D.
  • 年度 2002
  • 页码 184 p.
  • 总页数 184
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 无线电电子学、电信技术;自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号