A Multi-layered Summarization System for Multi-media Archives by Understanding and Structuring of Chinese Spoken Documents

机译：通过理解和构造中文口语文献的多媒体档案多层汇总系统

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The multi-media archives are very difficult to be shown on the screen, and very difficult to retrieve and browse. It is therefore important to develop technologies to summarize the entire archives in the network content to help the user in browsing and retrieval. In a recent paper [1] we proposed a complete set of multi-layered technologies to handle at least some of the above issues: (1) Automatic Generation of Titles and Summaries for each of the spoken documents, such that the spoken documents become much more easier to browse, (2) Global Semantic Structuring of the entire spoken document archive, offering to the user a global picture of the semantic structure of the archive, and (3) Query-based Local Semantic Structuring for the subset of the spoken documents retrieved by the user's query, providing the user the detailed semantic structure of the relevant spoken documents given the query he entered. The Probabilistic Latent Semantic Analysis (PLSA) is found to be helpful. This paper presents an initial prototype system for Chinese archives with the functions mentioned above, in which the broadcast news archive in Mandarin Chinese is taken as the example archive.

机译：多媒体档案很难在屏幕上显示，也很难检索和浏览。因此，重要的是开发技术来总结网络内容中的整个档案，以帮助用户浏览和检索。在最近的一篇论文中[1]，我们提出了一套完整的多层技术来处理至少一些上述问题：（1）自动为每个语音文档生成标题和摘要，从而使语音文档变得越来越多更易于浏览，（2）整个语音文档档案的全局语义结构，向用户提供档案语义结构的全局图片，以及（3）语音文档子集的基于查询的局部语义结构通过用户的查询检索，从而根据用户输入的查询向用户提供相关口头文件的详细语义结构。发现概率潜在语义分析（PLSA）是有帮助的。本文提出了具有上述功能的中文档案的初始原型系统，以普通话广播新闻档案为例。

著录项

来源
《Chinese Spoken Language Processing; Lecture Notes in Artificial Intelligence; 4274》|2006年|683-692|共10页
会议地点 Singapore(SG)
作者
Lin-shan Lee; Sheng-yi Kong; Yi-cheng Pan; Yi-sheng Fu; Yu-tsun Huang; Chien-Chih Wang;
展开▼
作者单位

Speech Lab, College of EECS National Taiwan University, Taipei;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类程序语言、算法语言;
关键词

相似文献

外文文献
中文文献
专利

1. A Comparative Study of Probabilistic Ranking Models for Chinese Spoken Document Summarization [J] . SHIH-HSIANG LIN, BERLIN CHEN, HSIN-MIN WANG ACM transactions on Asian language information processing . 2009,第1期

机译：汉语口语文摘概率等级模型的比较研究
2. Extractive spoken document summarization for information retrieval [J] . Berlin Chen, Yi-Ting Chen Pattern recognition letters . 2008,第4期

机译：提取语音文档摘要以进行信息检索
3. The user model-based summarize and refine approach improves information presentation in spoken dialog systems [J] . Andi K. Winterboer, Martin I. Tietze, Maria K. Wolters, Computer speech and language . 2011,第2期

机译：基于用户模型的汇总和细化方法改善了口语对话系统中的信息表示
4. A Multi-layered Summarization System for Multi-media Archives by Understanding and Structuring of Chinese Spoken Documents [C] . Lin-shan Lee, Sheng-yi Kong, Yi-cheng Pan, International Symposium on Chinese Spoken Language Processing . 2006

机译：通过了解和构建中文文献的多媒体档案的多层摘要系统
5. Summarizing spoken documents through utterance selection. [D] . Zhu, Xiaodan. 2010

机译：通过语音选择来总结口头文件。
6. Structure diagnostics of heterostructures and multi-layered systems by X-ray multiple diffraction [O] . Mariana Borcha, Igor Fodchuk, Mykola Solodkyi, -1

机译：X射线多重衍射对异质结构和多层系统的结构诊断
7. CHINESE SPOKEN DOCUMENT SUMMARIZATION USING PROBABILISTIC LATENT TOPICAL INFORMATION [O] . Berlin Chen, Yao-ming Yeh, Yao-min Huang, 2013

机译：使用概率潜在最新信息的中文口语摘要
8. Real-Time Spoken-Language System for Interactive Problem-Solving, Combining Linguistic and Statistical Technology for Improved Spoken Language Understanding. [R] . Moore, R. C., Cohen, M. H. 1993

机译：交互式问题解决的实时语言系统，结合语言和统计技术提高口语理解能力。

A Multi-layered Summarization System for Multi-media Archives by Understanding and Structuring of Chinese Spoken Documents

摘要

著录项

相似文献

相关主题

期刊订阅