An Efficient Tool for Building a Large Part-Of-Speech Annotated Corpus

机译：一个有效的工具，用于构建大型语音注释的语料库

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Large part-of-speech(pos) annotated corpus play an important role in many kinds of natural language processing. So, the annotated corpus requires very high accuracy and consistency. To build such accurate and consistent corpus, we often use manual tagging. But the manual tagging is very labor intensive and expensive. Furthermore, it is not easy to get consistent results from the human experts. The goal of this work is to develope an efficient tool for building accurate and a consistent pos annotated corpus with minimal human labor. The developed tool can help minimize the amount of the human labor and make the results consistent by using lexical rules. The lexical rules are acquired from human experts in the similar way of manual tagging and manual error correction. They are used to annotate the same word in the same context in the whole corpus.

机译：大部分演讲（POS）注释语料库在多种自然语言处理中发挥着重要作用。因此，注释的语料库需要非常高的精度和一致性。要构建如此准确和一致的语料库，我们经常使用手动标记。但手动标记非常劳动密集且昂贵。此外，不容易获得人类专家的一致结果。这项工作的目标是开发一个有效的工具，用于建立准确的准确性和一致的POS注释的语料库，具有最小的人工劳动力。开发的工具可以帮助最小化人工劳动量，并通过使用词汇规则使结果一致。词汇规则是以手动标记和手动纠错的类似方式从人类专家获取。它们用于在整个语料库中的同一上下文中注释相同的单词。

著录项

来源
《International conference on artificial intelligence》|2000年||共5页
会议地点
作者
Hae-Chang Rim; Heui-Seok Lim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
part-of-speech; corpus; annotated corpus; minimal human labor;

机译：词语部分;语料库;注释语料库;最小的人工劳动力;

相似文献

外文文献
中文文献
专利

1. Developing a corpus of clinical notes manually annotated for part-of-speech [J] . Serguei V. Pakhomov, Anni Coden, Christopher G. Chute International journal of medical informatics . 2006,第6期

机译：开发手动注释的词性的临床注释语料库
2. Building Arabic Corpus Applied to Part-of-Speech Tagging [J] . Rabab Ali Abumalloh, Hassan Maudi Al-Sarhan, Waheeb Abu-Ulbeh Indian Journal of Science and Technology . 2016,第46期

机译：构建应用于词性标记的阿拉伯语语料库
3. Building a Thai part-of-speech tagged corpus (ORCHID) [J] . Virach Sornlertlamvanich, Naoto Takahashi, Hitoshi Isahara, The Journal of the Acoustical Society of Japan . 1999,第3期

机译：建立泰语词性标记语料库（ORCHID）
4. An Efficient Tool for Building a Large Part-Of-Speech Annotated Corpus [C] . Hae-Chang Rim, Heui-Seok Lim International Conference on Artificial Intelligence IC-AI'2000 Vol.3, Jun 26-29, 2000, Las Vegas, Nevada, USA . 2000

机译：构建大型语音注释的语料库的有效工具
5. A Decision Support Tool for Designing Energy-Efficient Residential Buildings at the Early Planning and Design Stage [D] . Sun, Shilin. 2021

机译：在早期规划和设计阶段设计节能住宅建筑的决策支持工具
6. Building a semantically annotated corpus for chronic disease complications using two document types [O] . Noha Alnazzawi 2021

机译：使用两种文件类型构建用于慢性疾病并发症的语义注释的语料
7. Building Chinese Sense Annotated Corpus with the Help of Software Tools [O] . Yunfang Wu, Peng Jin, Tao Guo, 2009

机译：借助软件工具构建带有中文注释的语料库

An Efficient Tool for Building a Large Part-Of-Speech Annotated Corpus

摘要

著录项

相似文献

相关主题

期刊订阅