首页> 外文会议>International Conference on speech and computer >Detection of Stance and Sentiment Modifiers in Political Blogs
【24h】

Detection of Stance and Sentiment Modifiers in Political Blogs

机译:政治博客中姿态和情感修饰语的检测

获取原文

摘要

The automatic detection of seven types of modifiers was studied: Certainty, Uncertainty, Hypotheticality, Prediction, Recommendation, Concession/Contrast and Source. A classifier aimed at detecting local cue words that signal the categories was the most successful method for five of the categories. For Prediction and Hypotheticality, however, better results were obtained with a classifier trained on tokens and bigrams present in the entire sentence. Unsupervised cluster features were shown useful for the categories Source and Uncertainty, when a subset of the training data available was used. However, when all of the 2,095 sentences that had been actively selected and manually annotated were used as training data, the cluster features had a very limited effect. Some of the classification errors made by the models would be possible to avoid by extending the training data set, while other features and feature representations, as well as the incorporation of pragmatic knowledge, would be required for other error types.
机译:研究了七种修饰符的自动检测:确定性,不确定性,假设性,预测,推荐,让步/对比和来源。旨在检测表示类别的本地提示词的分类器是其中五个类别中最成功的方法。但是,对于预测和假设性,使用经过整句中存在的标记和二元组训练的分类器可以获得更好的结果。当使用可用训练数据的子集时,显示无监督的聚类特征对于“来源”和“不确定性”类别很有用。但是,当所有已经被主动选择并手动注释的2,095个句子都用作训练数据时,聚类特征的作用非常有限。通过扩展训练数据集,可以避免模型造成的一些分类错误,而其他错误类型则需要其他特征和特征表示以及务实知识的整合。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号