首页> 外文会议>International Conference on Semantics, Knowledge and Grids >GubaLex: Guba-Oriented Sentiment Lexicon for Big Texts in Finance
【24h】

GubaLex: Guba-Oriented Sentiment Lexicon for Big Texts in Finance

机译:GubaLex:面向Guba的情感词汇,用于金融领域的大手笔

获取原文

摘要

Trading in stock market depends mostly on investor's emotions though technical analysis is a viable tool there. In China, Guba is a typical platform for individual investors to share news and opinions on their favorite stocks. The texts posted in Guba by investors involve in richful emotions which can reflect their willingness on the stock. Few works focus on Guba sentiment analysis though numerous have been done on investor sentiment analysis in finance market for the purpose of understanding the market. Text mining is the most popular method to analyze the sentiment implied in the web text, which depends heavily on the lexicon. Existed lexicons for general purpose work badly on sentiment analysis for Guba messages. In this work, we construct a specified lexicon for Chinese Guba, named GubaLex, in considerations of the characteristics of the Guba text: short, emotion enriched, colloquial (informal), and stock market oriented. It is constructed by using the merge of HowNet and NTUSD as the basic sentiment lexicon, then adding stock terms from the Guba corpus and information in the area of stock market. Based on GubaLex, we develop the bullish lexicon GLBull and the bearish lexicon GL-Bear especially including bullish and bearish sentiment terms for further sentiment analysis. We also proposed an auto update module and sentiment classification algorithm for Guba texts. The experiments show the proposed lexicon works better in sentiment analysis than the previous, like HowNet and NTUSD.
机译:股市交易主要取决于投资者的情绪,尽管技术分析是可行的工具。在中国,古巴是个人投资者分享自己喜欢的股票的新闻和观点的典型平台。投资者在古巴(Guba)发布的文本充满了丰富的情感,可以反映出他们对股票的意愿。尽管为了了解市场,对金融市场的投资者情绪分析进行了大量工作,但很少有文章专注于Guba情绪分析。文本挖掘是分析Web文本中包含的情感的最流行的方法,它在很大程度上取决于词典。用于通用目的的现有词典在Guba消息的情感分析上表现不佳。在这项工作中,我们考虑到古巴文字的特点:简短,情感丰富,口语化(非正式)和面向股票市场,为中国古巴构建了一个指定的词典,名为GubaLex。它是通过使用HowNet和NTUSD的合并作为基本情感词典来构造的,然后添加来自Guba语料库的股票条款和股票市场方面的信息。在GubaLex的基础上,我们开发了看涨词典GLBull和看跌词典GL-Bear,特别是包括看涨和看跌情感术语,用于进一步的情感分析。我们还为古巴语文本提出了一种自动更新模块和情感分类算法。实验表明,所提出的词典在情感分析方面比HowNet和NTUSD更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号