首页> 外文期刊>Telematics and Informatics >Lexicon-based approach outperforms Supervised Machine Learning approach for Urdu Sentiment Analysis in multiple domains
【24h】

Lexicon-based approach outperforms Supervised Machine Learning approach for Urdu Sentiment Analysis in multiple domains

机译:在多个领域中,基于词典的方法胜过有监督的机器学习方法进行乌尔都语情感分析

获取原文
获取原文并翻译 | 示例
           

摘要

Web is facilitating people to express their views and opinions on different topics through reviews and blogs. Effective advantages can be reaped from these reviews and blogs by fusing the sentiment knowledge. In this research, Sentiment Analysis of Urdu blogs from multiple domains is done by using the two widely used approaches i.e. the Lexicon-based approach and the Supervised Machine Learning approach. Three well known classifiers i.e. Support Vector Machine, Decision Tree and K Nearest Neighbor are used in case of Supervised Machine Learning approach whereas a wide coverage Urdu Sentiment Lexicon and an efficient Urdu Sentiment Analyzer are used in Lexicon-based approach. In both the approaches the information are fused from two sources to successfully perform Sentiment Analysis. In case of Lexicon-based approach, the two sources are the wide coverage Urdu Sentiment Lexicon and the efficient Urdu Sentiment Analyzer. In case of Supervised Machine Learning approach, the two sources are the un-annotated data and annotated data along with important attributes. After performing Sentiment Analysis using both the approaches, the results are observed carefully and on the basis of experiments performed in this research, it is concluded that the Lexicon-based approach outperforms Supervised Machine Learning approach not only in terms of Accuracy, Precision, Recall and F-measure but also in terms of economy of time and efforts used.
机译:网络正在促进人们通过评论和博客表达对不同主题的观点和看法。通过融合情感知识,可以从这些评论和博客中获得有效的优势。在这项研究中,使用两种广泛使用的方法,即基于词典的方法和有监督的机器学习方法,对来自多个领域的乌尔都语博客进行了情感分析。在监督机器学习方法的情况下,使用了三个众所周知的分类器,即支持向量机,决策树和K最近邻,而在基于词典的方法中使用了覆盖面广的Urdu情感词典和高效的Urdu情感分析器。在这两种方法中,信息都是从两个来源融合而来,以成功执行情感分析。在基于Lexicon的方法的情况下,两个来源是覆盖面广的Urdu情感词典和高效的Urdu情感分析器。在监督机器学习方法的情况下,两个来源是未注释的数据和带有重要属性的已注释数据。使用这两种方法进行情感分析后,将仔细观察结果,并根据本研究中进行的实验得出结论:基于词典的方法不仅在准确性,精度,召回率和F度量,而且还可以节省时间和精力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号