首页> 外文期刊>Procedia Computer Science >Effects of Word Class and Text Position in Sentiment-based News Classification
【24h】

Effects of Word Class and Text Position in Sentiment-based News Classification

机译:词类和文本位置在基于情感的新闻分类中的作用

获取原文
           

摘要

In news domain, sentiments captured in the form of sentiment labels (emoticon) give a quick feedback of reactions towards the contents of the news. As these reactions are valuable indicators for social and political well beings, we are motivated to automate the classification of news texts based on these indicators, e.g. happy, sad, angry, amused etc. Unlike other review texts that contain more explicit words which can be interpreted directly for sentiment classification, news texts mostly report facts and figures. This resulted in needs to identify whether contents of news can be exploited for classification or otherwise. Hence, in this work, a study is conducted to analyze and determine the relevant key parts of news contents that can be to be used for sentiment-based classification. Two criteria, i.e. text Part of Speech and text position, which could possible influence the training of the classifier are studied. Evaluations are conducted on the collection of 250 English news texts labelled with sentiments from sentiment voting system. The results for sentiment-based category has recorded F score of 0.422 whereas for polarity-based category has recorded F score of 0.837. The study has shown that when finer categories (e.g. happy, sad etc.) are used, the inspected criteria are less effectively; however, when these categories are based on polarity orientations, the outcomes show potentials of the proposed criteria especially for text positioned at headlines and text using adjective words.
机译:在新闻领域中,以情感标签(表情符号)形式捕获的情感可以快速反馈对新闻内容的反应。由于这些反应是社会和政治福祉的宝贵指标,因此我们有动机根据这些指标对新闻文本进行自动分类。快乐,悲伤,愤怒,逗乐等。与其他评论文本中包含更明确的词(可以直接将其用于情感分类)不同,新闻文本主要报道事实和数字。这导致需要确定新闻内容是否可以用于分类。因此,在这项工作中,进行了一项研究,以分析和确定可用于基于情感的分类的新闻内容的相关关键部分。研究了两个标准,即可能影响分类器训练的文本词性和词性。对从情感投票系统中标有情感的250种英语新闻文本的收集进行评估。基于情感的类别的结果已记录为0.422,而基于极性的类别的结果已记录为0.837。研究表明,当使用更精细的类别(例如,快乐,悲伤等)时,检查的标准不太有效;但是,当这些类别基于极性方向时,结果将显示出拟议标准的潜力,尤其是对于标题处的文本和使用形容词的文本而言。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号