Unleashing the killer corpus: experiences in creating the multi-everything AMI Meeting Corpus

Jean Carletta

首页> 外文期刊>Language Resources and Evaluation >Unleashing the killer corpus: experiences in creating the multi-everything AMI Meeting Corpus

【24h】

Unleashing the killer corpus: experiences in creating the multi-everything AMI Meeting Corpus

机译：释放杀手语料库：创建多功能AMI会议语料库的经验

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The AMI Meeting Corpus contains 100 h of meetings captured using many synchronized recording devices, and is designed to support work in speech and video processing, language engineering, corpus linguistics, and organizational psychology. It has been transcribed orthographically, with annotated subsets for everything from named entities, dialogue acts, and summaries to simple gaze and head movement. In this written version of an LREC conference keynote address, I describe the data and how it was created. If this is "killer" data, that presupposes a platform that it will "sell"; in this case, that is the NITE XML Toolkit, which allows a distributed set of users to create, store, browse, and search annotations for the same base data that are both time-aligned against signal and related to each other structurally.

机译：AMI会议语料库包含使用许多同步记录设备捕获的100小时会议，旨在支持语音和视频处理，语言工程，语料库语言学和组织心理学方面的工作。它以正交方式进行转录，带有注释的子集，包括从命名实体，对话行为，摘要到简单的注视和头部移动的所有内容。在LREC会议主题演讲的书面版本中，我描述了数据及其创建方式。如果这是“杀手级”数据，那么该平台将“出售”。在这种情况下，这就是NITE XML Toolkit，它允许一组分布式用户为相同的基础数据创建，存储，浏览和搜索批注，这些批注与信号在时间上对齐并且在结构上相互关联。

著录项

来源
《Language Resources and Evaluation》 |2007年第2期|p.181-190|共10页
作者
Jean Carletta;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Creating a live, public short message service corpus: the NUS SMS corpus [J] . Tao Chen, Min-Yen Kan Language Resources and Evaluation . 2013,第2期

机译：创建一个实时的公共短消息服务语料库：NUS SMS语料库
2. Creating a live, public short message service corpus: the NUS SMS corpus [J] . Tao Chen, Min-Yen Kan Language Resources and Evaluation . 2013,第2期

机译：创建一个实时的公共短消息服务语料库：NUS SMS语料库
3. Volcanic experiences: comparing non-corpus-based translations with corpus-based translations in translation training [J] . Giampieri Patrizia Perspectives: studies in translatology . 2021,第1a2期

机译：火山体验：将基于非语料库的翻译与基于语料库的翻译进行比较翻译培训
4. Annotation and Recognition of Personality Traits in Spoken Conversations from the AMI Meetings Corpus [C] . Fabio Valente, Samuel Kim, Petr Motlicek Annual conference of the International Speech Communication Association . 2012

机译：AMI会议语料库中的口语对话中人格特征的注释和识别
5. Automatic extractive summarization on meeting corpus. [D] . Xie, Shasha. 2010

机译：会议语料库的自动提取摘要。
6. Musculoskeletal Fitness Measures Are Not Created Equal: An Assessment of School Children in Corpus Christi Texas [O] . Toyin Ajisafe, Theresa Garcia, Hsin-Chen Fanchiang 2018

机译：肌肉骨骼健身指标不均等：德克萨斯州科珀斯克里斯蒂市对学童的评估
7. Unleashing the killer corpus: experiences in creating the multi-everything AMI Meeting Corpus [O] . Jean Carletta 2014

机译：释放杀手语料库：创建多功能AMI会议语料库的经验

Unleashing the killer corpus: experiences in creating the multi-everything AMI Meeting Corpus

摘要

著录项

相似文献

相关主题

期刊订阅