Domain dependence of statistical named entity recognition and classification in Croatian texts

机译：统计名为实体识别与克罗地亚文本分类的域依赖性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Influence of text domain selection on statistical named entity recognition and classification in Croatian texts is investigated. Two datasets of Croatian newspaper texts of differing text domains were manually annotated for named entities and used for training and testing the Stanford NER system for named entity recognition based on sequence labeling with CRF. State of the art scores were observed in both domains. A strong preference for systems trained on mixed text domains is established by the experiment. The top-performing system was recorded with an overall F1-score of 0.876 on mixed-domain test sets, scoring 0.899 in one of the selected domains and 0.852 in the other. The single best domain F1-scores were recorded at 0.910 and 0.858.

机译：调查了文本域选择对克罗地亚文本统计名称实体识别和分类的影响。用于命名实体的两个克罗地亚报纸文本的两个数据集被手动注释，用于命名实体，用于基于CRF的序列标记的命名实体识别训练和测试STANFORD NER系统。在两个域中观察到最先进的评分。实验建立了对混合文本域训练的系统的强烈偏好。在混合域试验组上记录了顶级性能系统，总体F1分数为0.876，在其中一个选定的域中进行0.899，另一个域中的0.852。单个最佳域F1分数记录在0.910和0.858。

著录项

来源
《International Conference on Information Technology Interfaces》|2013年||共6页
会议地点
作者
Agic Zeljko; Bekavac Bozo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 G202-53;
关键词
Croatian language; domain dependence; named entity recognition; text domain;

机译：克罗地亚语;域依赖;命名实体识别;文本域;

相似文献

外文文献
中文文献
专利

1. Domain-aware Evaluation of Named Entity Recognition Systems for Croatian [J] . Agic Zeljko, Bekavac Bozo Journal of computing and information technology . 2013,第3期

机译：克罗地亚命名实体识别系统的域感知评估
2. Domain-aware Evaluation of Named Entity Recognition Systems for Croatian [J] . Zeljko Agic, Bozo Bekavac Journal of Computing and Information Technology . 2013,第3期

机译：克罗地亚命名实体识别系统的域感知评估
3. Named entity recognition and classification in biomedical text using classifier ensemble [J] . Saha Sriparna, Ekbal Asif, Sikdar Utpal Kumar International journal of data mining and bioinformatics . 2015,第4期

机译：使用分类器集成在生物医学文本中命名实体识别和分类
4. Domain dependence of statistical named entity recognition and classification in Croatian texts [C] . Agic Zeljko, Bekavac Bozo 35th International Conference on Information Technology Interfaces : Research and Education using Mobile and Social Networking: When, Where, and How . 2013

机译：克罗地亚语文本中统计命名实体的识别和分类的域依赖性
5. A data-intensive approach to named entity recognition using domain and language independent methods [D] . Osesina, Olukayode Isaac. 2010

机译：使用领域和语言无关的方法进行的数据密集型命名实体识别方法
6. De-identifying Spanish medical texts - named entity recognition applied to radiology reports [O] . Irene Pérez-Díez, Raúl Pérez-Moraga, Adolfo López-Cerdán, 2021

机译：去识别西班牙医学文本 - 命名实体识别适用于放射学报告
7. Domain-aware Evaluation of Named Entity Recognition Systems for Croatian [O] . Agic, Zeljko, Bekavac, Bozo 2013

机译：克罗地亚命名实体识别系统的域感知评估

Domain dependence of statistical named entity recognition and classification in Croatian texts

摘要

著录项

相似文献

相关主题

期刊订阅