首页> 美国政府科技报告 >Categorial Variation Database for English
【24h】

Categorial Variation Database for English

机译:日语的分类变异数据库

获取原文

摘要

We describe our approach to the construction and evaluation of a large-scale database called 'CatVar' which contains categorial variations of English lexemes. Due to the prevalence of crosslanguage categorial variation in multilingual applications our categorial variation resource may serve as an integral part of a diverse range of natural language applications. Thus, the research reported herein overlaps heavily with that of the machine- translation, lexicon construction, and information-retrieval communities. We apply the information-retrieval metrics of precision and recall to evaluate the accuracy and coverage of our database with respect to a human-produced gold standard. This evaluation reveals that the categorical database achieves a high degree of precision and recall. Additionally, we demonstrate that the database improves on the linkability of Porter Stemmer by over 30/%.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号