A Classical Chinese Corpus with Nested Part-of-Speech Tags

机译：一个古典的汉语语料库，嵌套部分 - 言论标签

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We introduce a corpus of classical Chinese poems that has been word segmented and tagged with parts-of-speech (POS). Due to the ill-defined concept of a 'word' in Chinese, previous Chinese corpora suffer from a lack of standardization in word segmentation, resulting in inconsistencies in POS tags, therefore hindering interoperability among corpora. We address this problem with nested POS tags, which accommodates different theories of wordhood and facilitates research objectives requiring annotations of the 'word' at different levels of granularity.

机译：我们介绍了一个古典诗歌的语料库，这是词分割的，并用演讲部分（POS）标记。由于中文中的“单词”概念，以前的中国语料库患有单词分割中缺乏标准化，导致POS标签不一致，因此在Corpora之间妨碍互操作性。我们通过嵌套的POS标签来解决这个问题，该标签可以容纳不同的措辞理论，并促进需要在不同粒度水平的“单词”注释的研究目标。

著录项

来源
《Conference of the European Chapter of the Association for Computational Linguistics》|2012年||共10页
会议地点
作者
John Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. A fine-grained Chinese word segmentation and part-of-speech tagging corpus for clinical text [J] . Ying Xiong, Zhongmin Wang, Dehuan Jiang, BMC Medical Informatics and Decision Making . 2019,第2期

机译：用于临床文本的细粒度中文分词和词性标注语料库
2. Corpus based part-of-speech tagging [J] . Chengyao Lv, Huihua Liu, Yuanxing Dong, International journal of speech technology . 2016,第3期

机译：基于语料库的词性标注
3. Building Arabic Corpus Applied to Part-of-Speech Tagging [J] . Rabab Ali Abumalloh, Hassan Maudi Al-Sarhan, Waheeb Abu-Ulbeh Indian Journal of Science and Technology . 2016,第46期

机译：构建应用于词性标记的阿拉伯语语料库
4. A Classical Chinese Corpus with Nested Part-of-Speech Tags [C] . John Lee Language Technology for Cultural Heritage, Social Sciences, and Humanities 2012 . 2012

机译：带有嵌套词性标签的中国古典语料库
5. IITagger: Tagging Wall Street Journal text with part-of-speech information [D] . Kim, Yeongkwun 1996

机译：IITagger：使用词性信息标记“华尔街日报”文本
6. A fine-grained Chinese word segmentation and part-of-speech tagging corpus for clinical text [O] . Ying Xiong, Zhongmin Wang, Dehuan Jiang, 2019

机译：用于临床文本的细粒度中文分词和词性标注语料库
7. A fine-grained Chinese word segmentation and part-of-speech tagging corpus for clinical text [O] . Ying Xiong, Zhongmin Wang, Dehuan Jiang, 2019

机译：临床文本的一个细粒度的汉语词分割和词语标记语料库

A Classical Chinese Corpus with Nested Part-of-Speech Tags

摘要

著录项

相似文献

相关主题

期刊订阅