首页> 外文会议>International conference on artificial intelligence;ICAI 2011 >Taxonomy and Evaluation of Markers for Computational Stylistics
【24h】

Taxonomy and Evaluation of Markers for Computational Stylistics

机译:计算文体学的分类学和标记评估

获取原文

摘要

Currently, stylistic analysis of natural language texts is achieved through a wide variety of techniques containing many different algorithms, feature sets and collection methods. Most machine-learning methods rely on feature extraction to model the text and perform classification. But what are the best features for making style based distinctions? While many researchers have developed particular collections of style features -called style markers - no definitive list exists. In this paper we present an organized collection of such style markers with performance data on a diverse set of texts. We show that for each training document, one or more markers exist that can distinguish it from others, providing a basis for a weighted, combined set of markers that outperform any of the individual ones. We examine and categorize 502 style markers, both individually and as a set, and evaluate their performance on several English language text collections.
机译:目前,通过包含许多不同算法,特征集和收集方法的各种技术来实现自然语言文本的风格分析。大多数机器学习方法依赖于特征提取来模拟文本并执行分类。但是制造基于风格的区别的最佳功能是什么?虽然许多研究人员已经开发了特定的风格集合 - 所有风格标记 - 不存在明确的列表。在本文中,我们在各种文本上提供了一个有绩效数据的这种风格标记的组织集合。我们表明,对于每个培训文档,存在一个或多个可以将其与其他标记区分开的标记,为加权,组合集标记的基础提供了胜过任何单独的标记。我们检查和分类为502样式标记,单独和作为集合,并在几种英语语言文本集合上评估其性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号