首页> 外文期刊>Knowledge-Based Systems >A context-aware embeddings supported method to extract a fuzzy sentiment polarity dictionary
【24h】

A context-aware embeddings supported method to extract a fuzzy sentiment polarity dictionary

机译:支持上下文感知的嵌入提取模糊情感极性字典的方法

获取原文
获取原文并翻译 | 示例
           

摘要

The latest development in cognitive technologies are helping us understand emotions and sentiments with unprecedented precision. Polarity detection is the key enabler to sentiment analysis and typically relies on experimental dictionaries, where terms are assigned polarity scores, yet lacking contextual information and based on human inputs and conventions. In this article, we present a novel approach to automatically extract a polarity dictionary from a particular domain, the stock market, without human intervention and addressing the scaling and thresholding problem. Our approach tracks the price changes of particular stocks over time, using it as a guiding polarity value. The magnitude of the price variation for a particular stock is then attributed to the financial news about this stock in corresponding period of time and that is what we use as our working corpus. On top of that, we derive the so-called binned corpus and apply the well-known TF-IDF information retrieval techniques to compute the TF-IDF value for each term. These values are then disseminated within the neighbourhood of each term based on the embeddings-enabled cosine distance. After introducing the problem and providing the background information, we thoroughly describe our method and all the components required to implement the system. Last but not least, we assign the terms to fuzzy linguistic labels and provide a volatility metric indicating how reliable our scores are depending on their distribution of occurrences in the corpus. To show how our approach works, we implement it for the Euro Stoxx 50 from January 2018 to March 2019 and discuss the results compared with typical approaches, pointing out potential improvements for further research work. (C) 2019 Elsevier B.V. All rights reserved.
机译:认知技术的最新发展正在帮助我们以前所未有的精确度来理解情绪和情感。极性检测是情感分析的关键推动力,通常依赖于实验词典,在该词典中为术语分配了极性分数,但缺少上下文信息并且基于人工输入和约定。在本文中,我们提出了一种新颖的方法来自动从特定领域(股票市场)提取极性字典,而无需人工干预并解决缩放和阈值问题。我们的方法将特定股票的价格随时间变化,以此作为指导极性值。然后,将特定股票的价格变动幅度归因于相应时间段内有关该股票的财经新闻,这就是我们用作工作语料库的时间。最重要的是,我们得出了所谓的合并语料库,并应用了众所周知的TF-IDF信息检索技术来计算每个术语的TF-IDF值。然后,基于嵌入启用的余弦距离,将这些值散布在每个项的附近。在介绍了问题并提供了背景信息之后,我们彻底描述了我们的方法以及实现该系统所需的所有组件。最后但并非最不重要的一点是,我们将这些术语分配给模糊的语言标签,并提供一个波动率度量,该度量表明我们的分数在语料库中的分布取决于其得分的可靠性。为了展示我们的方法是如何工作的,我们从2018年1月至2019年3月在Euro Stoxx 50上实施了该方法,并将结果与​​典型方法进行了讨论,指出了可能进行进一步研究的改进方法。 (C)2019 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号