首页> 外文会议>International Conference on Document Analysis and Recognition >'The Godfather' vs. 'Chaos': Comparing Linguistic Analysis based on On-line Knowledge Sources and Bags-of-N-Grams for Movie Review Valence Estimation
【24h】

'The Godfather' vs. 'Chaos': Comparing Linguistic Analysis based on On-line Knowledge Sources and Bags-of-N-Grams for Movie Review Valence Estimation

机译:“教父”与“混乱”:比较基于在线知识来源的语言分析和电影审查价值估算的袋式袋

获取原文

摘要

In the fields of sentiment and emotion recognition, bag of words modeling has lately become popular for the estimation of valence in text. A typical application is the evaluation of reviews of e. g. movies, music, or games. In this respect we suggest the use of back-off N-Grams as basis for a vector space construction in order to combine advantages of word-order modeling and easy integration into potential acoustic feature vectors intended for spoken-document retrieval. For a fine granular estimate we consider data-driven regression next to classification based on Support Vector Machines. Alternatively the on-line knowledge sources ConceptNet, General Inquirer, and WordNet not only serve to reduce out-of-vocabulary events, but also as basis for a purely linguistic analysis. As special benefit, this approach does not demand labeled training data. A large set of 100k movie reviews of 20years stemming from Metacritic is utilized throughout extensive parameter discussion and comparative evaluation effectively demonstrating efficiency of the proposed methods.
机译:在情感和情感识别领域,袋子建模最近估计在文本中的价值。典型的应用是评估e的评估。 G。电影,音乐或游戏。在这方面,我们建议使用退避n-gram作为矢量空间结构的基础,以便将字阶建模和易于集成到旨在发言检索的潜在声学特征向量中的优点。对于细粒度估计,我们考虑基于支持向量机的分类旁边的数据驱动回归。或者,在线知识源概念,一般询问者和Wordnet不仅用于减少词汇流的事件,而且是纯语言分析的基础。作为特殊好处,这种方法不需要标记为培训数据。在整个参数讨论和比较评估中,利用了来自Metacitic的20年的大量100K电影评论,并有效地证明了所提出的方法的效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号