首页> 外文期刊>Transportation Research Procedia >Traffic safety evaluation in Northwestern Federal District using sentiment analysis of Internet users’ reviews
【24h】

Traffic safety evaluation in Northwestern Federal District using sentiment analysis of Internet users’ reviews

机译:西北联邦地区交通安全评估使用互联网用户评论的情感分析

获取原文
           

摘要

The paper addresses the task of analyzing traffic safety in the Northwestern Federal District according to the reviews published in the Web. To accomplish the task, the authors developed a system of automatic review classification based on a sentiment classifier. They analyzed open source libraries for data mining, developed a web crawler using Scrapy framework, written in Python 3, and collected reviews. They also considered the methods of text vectorization and lemmatization and their application in the Scikit-Learn library: Bag-of-Words, N-gram, CountVectorizer, and TF-IDF Vectorizer. For the purpose of classification, the authors used the na?ve Bayes algorithm and a linear classifier model with stochastic gradient descent optimization. A base of tagged Twitter reviews was used as a training set. The classifier was trained using cross-validation and ShuffleSplit strategies. The authors also tested and compared the classification results for different classifiers. As a result of validation, the best model was determined. The developed system was applied to analyze the quality of roads in the Northwestern Federal District. Based on the outcome, the roads were marked-up in color to illustrate the results of the research.
机译:本文根据网络发布的评价,涉及分析西北联邦区交通安全的任务。为完成任务,作者开发了一种基于情感分类器的自动审查分类系统。他们分析了用于数据挖掘的开源库,使用Scrapy Framework开发了一个Web爬虫,编写在Python 3,并收集评论。他们还考虑了文本矢量化和lemmatization的方法,以及它们在Scikit-Learn库中的应用程序:文字袋,n-gram,countVectorizer和TF-IDF矢量化器。为了分类,作者使用了具有随机梯度下降优化的Na ve贝叶斯算法和线性分类器模型。标记的Twitter评论基础被用作培训集。分类器使用交叉验证和Shufflesplit策略进行培训。作者还测试了并比较了不同分类器的分类结果。由于验证,确定了最佳模型。开发系统应用于分析西北联邦区道路质量。基于结果,道路以颜色标记为以说明研究结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号