首页> 外文会议>WRI World Congress on Computer Science and Information Engineering >Chinese Verb Subcategorization Acquisition from Noisy Data on Sentence Level
【24h】

Chinese Verb Subcategorization Acquisition from Noisy Data on Sentence Level

机译:从句子级别的嘈杂数据中获取汉语动词分类

获取原文

摘要

Subcategorization is the process that further classifies a syntactic category into its subsets. Aiming to improve the recall of acquisition, we design an automatic approach of enriching the argument knowledge of SCF by means of active learning and employing a multi-class SVM model to classify argument type. We could thus give an accurate SCF as output for each input sentence, even on noisy data, meanwhile avoiding writing rules by hand. Our approach generates hypothesis directly without statistical filtering as the next step after generation. Experiments results indicate that the acquisition performance is significantly improved especially in the aspect of recall, which was increased from 88.83 to 99.75 in open test.
机译:子类别是进一步将语法类别分类为其子集的过程。旨在提高收购的回忆,我们通过主动学习设计了丰富SCF的参数知识的自动方法,并采用多级SVM模型来分类参数类型。因此,即使在嘈杂的数据上,我们也可以为每个输入句提供准确的SCF作为每个输入句的输出,同时避免手动编写规则。我们的方法直接产生假设,没有统计过滤作为生成后的下一步。实验结果表明,特别是在召回方面显着改善了采集性能,从开放试验中增加到88.83至99.75。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号