首页> 外文会议>International workshop on semantic evaluation >UCD : Diachronic Text Classification with Character, Word, and Syntactic N-grams
【24h】

UCD : Diachronic Text Classification with Character, Word, and Syntactic N-grams

机译:UCD:Diachronic文本分类与字符,单词和句法n-grams

获取原文

摘要

We present our submission to SemEval-2015 Task 7: Diachronic Text Evaluation, in which we approach the task of assigning a date to a text as a multi-class classification problem. We extract n-gram features from the text at the letter, word, and syntactic level, and use these to train a classifier on date-labeled training data. We also incorporate date probabilities of syntactic features as estimated from a very large external corpus of books. Our system achieved the highest performance of all systems on subtask 2: identifying texts by specific time language use.
机译:我们向Semeval-2015任务7:DiaCronic文本评估提供了我们的提交,我们将向文本分配给文本的任务作为多级分类问题。我们从字母,单词和句法级别的文本中提取n-gram功能,并使用这些来在日期标记的培训数据上培训分类器。我们还将句法特征的日期概率纳入了一本非常大的书籍中的概要。我们的系统实现了SubTask 2上所有系统的最高性能:通过特定时间语言使用识别文本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号