首页> 外国专利> Stock forecast model based on text news by random forest

Stock forecast model based on text news by random forest

机译:基于随机森林文本消息的库存预测模型

摘要

#$%^&*AU2018101531A420181115.pdf#####ABSTRACT: The main purpose of this project is to use random forest (RF) algorithm to analyze the correlation between the stock news on the historical days and the ups and downs of the stocks the next day. We then find out the information hidden behind these data by sorting and screening stocks, that is, the key words or key events related to the rise and fall of the stocks to predict the rise and fall of the stocks in the near future. By using this method, after entering the news data of the stock market on that day, we can predict the rise and fall of one stock the next day. This relatively accurate method can help shareholders get rid of the risk of stock investment and even can guarantee stable investment returns. This model uses random forest algorithm to carry out excavating and classifying the information of text news. Random forest is a combinatorial classifier, which can be used for the classification and screening of the stocks. The essence of it is a set of tree classifiers. Among them, the base classifier H (x, beta k) is a classification decision tree constructed by CART algorithm without pruning. x is the input vector and beta k is an independent and identically distributed random vector which determine the growth process of the single tree (base classifier). The output is determined by a simple majority voting method.START READ ALL THE CONTENTS INPUT OF THE NEWS DATA DATA TRAIN THE CLEANING W2V MODEL READ THE CONTENTS ACCORDING TO THE DATES MATCH EACH WORD WITH A VECTOR TAKE T HE AVERAGE OF THE WORD VECTORS TEXT VECTORISATION INPUT(x) INPUT(y) RANDOM 1 FOREST MODEL Fig.1 General Flow-Chart
机译:#$%^&* AU2018101531A420181115.pdf #####抽象:该项目的主要目的是使用随机森林(RF)算法来分析历史上的股票新闻与股票的涨跌之间的相关性第二天的股票。然后我们通过以下方式找出隐藏在这些数据背后的信息分类和筛选股票,即与上升有关的关键词或关键事件股票的下跌和下跌,以预测不久的将来股票的上涨和下跌。通过使用这种方法,输入当天的股市新闻数据后,我们可以预测第二天一只股票的涨跌。这种相对准确的方法可以帮助股东摆脱股票投资的风险,甚至可以保证稳定投资收益。该模型采用随机森林算法进行挖掘对文本新闻信息进行分类。随机森林是一个组合分类器,可用于股票的分类和筛选。的它的本质是一组树分类器。其中,基本分类器H(x,beta k)为通过CART算法构造的分类决策树,无需修剪。 x是输入向量和beta k是一个独立且分布均匀的随机向量决定单棵树(基础分类器)的生长过程。输出是通过简单的多数表决方法确定。开始阅读所有内容输入新闻数据数据培训清洁W2V模型阅读内容根据日期相互匹配单词与矢量取平均值单词矢量文本可视化输入(x)输入(y)随机1森林模型图1一般流程图

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号