首页> 外文会议>International Conference of Artificial Intelligence and Information Technology >Word Embedding Comparison for Indonesian Language Sentiment Analysis
【24h】

Word Embedding Comparison for Indonesian Language Sentiment Analysis

机译:词嵌入比较在印尼语言情感分析中的应用

获取原文

摘要

Development of information technology makes the production of data increase dramatically. We can get lots of data from the internet, including data reviews about a product or service. The more data obtained, the system is needed to process it. Sentiment analysis is a text processing of Natural Language Processing (NLP) that can help someone to see the quality of service offered, including hotel services. This paper uses hotel review data to carry out sentiment analysis obtained from the Traveloka website. The data classified using the Long Short-Term Memory (LSTM) algorithm. To get better results, the authors use word embedding to convert words into vectors. This study aims to compare the performance of several word embedding, while word embedding compared is word2vec Continuous Bag of Words CBOW, word2vec skip-gram, doc2vec, and glove. From the experiment conducted, the results show that the glove method has the highest accuracy of 95.52% while the word2vec skip-gram model has the lowest accuracy of 91.81%, so it concluded that the glove method is the best word embedding method for hotel review data.
机译:信息技术的发展使数据的产生急剧增加。我们可以从互联网上获取大量数据,包括有关产品或服务的数据评论。获得的数据越多,需要系统对其进行处理。情感分析是自然语言处理(NLP)的文本处理,可以帮助某人查看所提供服务的质量,包括酒店服务。本文使用酒店评论数据来进行从Traveloka网站获得的情感分析。使用长短期记忆(LSTM)算法对数据进行分类。为了获得更好的结果,作者使用单词嵌入将单词转换为向量。本研究旨在比较几种单词嵌入的性能,而比较的单词嵌入是word2vec单词连续包CBOW,word2vec跳过语法,doc2vec和手套。实验结果表明,手套法的准确率最高,为95.52%,而word2vec跳过图模型的准确度最低,为91.81%,因此得出结论,手套法是酒店评价的最佳词嵌入方法。数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号