【24h】

Taggers for Unknown Words using Decision Tree and Lazy Learning

机译:使用决策树和惰性学习对未知单词进行标记

获取原文
获取原文并翻译 | 示例

摘要

This paper describes methods of tagging unknown words using decision tree induction and lazy learnign as post processes of morphological analysis. Unknown words are words which cannot be looked up in a dictionary of a natural language processing system. These are often named entities such as the name of a person, organization, or location. And these words play an important role in the application of information extraction. In this paper, we apply automated taggers that learn from training data which is written by hand. The algorithms of the taggers are a decision tree generated by C4.5 and a lazy learning algorithm. Attributes used for accurately tagging are notation of unknown words and the part of speech of neighboring words, and tag sets we classify htese words are noun and some named entities. The experimental result of evaluating this method shows correct answers of 65
机译:本文介绍了使用决策树归纳和惰性学习作为词法分析的后处理方法来标记未知单词的方法。未知单词是无法在自然语言处理系统的词典中查找的单词。这些通常被命名为实体,例如人员,组织或位置的名称。这些词在信息提取的应用中起着重要的作用。在本文中,我们应用了自动标记器,该标记器从手工编写的训练数据中学习。标记器的算法是C4.5生成的决策树和惰性学习算法。用于准确标记的属性是未知单词的表示法和相邻单词的语音部分,我们将这些单词分类的标记集是名词和一些命名实体。评价该方法的实验结果显示正确答案为65

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号