首页> 外文期刊>ACM transactions on Asian language information processing >Novel Character Identification Utilizing Semantic Relation with Animate Nouns in Korean
【24h】

Novel Character Identification Utilizing Semantic Relation with Animate Nouns in Korean

机译:利用韩语动词与语义关系的新颖字符识别

获取原文
获取原文并翻译 | 示例
       

摘要

For identifying speakers of quoted speech or extracting social networks from literature, it is indispensable to extract character names and nominals. However, detecting proper nouns in the novels translated into or written in Korean is harder than in English because Korean does not have a capitalization feature. In addition, it is almost impossible for any proper noun dictionary to include all kinds of character names that have been created or will be created by authors. Fortunately, a previous study shows that utilizing postpositions for animate nouns is a simple and effective tool for character identification in Korean novels without a proper noun dictionary and a training corpus. In this article, we propose a character identification method utilizing the semantic relation with known animate nouns. For 80 novels in Korean, the proposed method increases the micro- and macro-average recall by 13.68% and 11.86%, respectively, while decreasing the micro-average precision by 0.28% and increasing the macro-average precision by 0.07% compared to the previous study. If we focus on characters that are responsible for more than 1% of the character name mentions in each novel, the micro- and macro-average F-measure of the proposed method are 96.98% and 97.32%, respectively.
机译:为了识别引用语音的说话者或从文学中提取社交网络,提取字符名称和名词是必不可少的。但是,由于韩语没有大写字母,因此要比用英语更难检测出用韩语翻译或写成的小说中的专有名词。另外,任何专有名词词典几乎都不可能包含已经创建或将由作者创建的各种字符名称。幸运的是,以前的研究表明,在没有适当的名词词典和训练语料库的情况下,利用后置位置设置动画名词是韩国小说中字符识别的一种简单有效的工具。在本文中,我们提出一种利用与已知有生命名词的语义关系的字符识别方法。对于朝鲜语中的80部小说,所提出的方法分别使微平均和宏观平均召回率分别提高了13.68%和11.86%,而微平均召回率降低了0.28%,宏观平均召回率提高了0.07%。以前的研究。如果我们关注的字符占每本小说中提到的字符名称的1%以上,那么该方法的微观平均和宏观平均F值分别为96.98%和97.32%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号