首页> 外文会议>International Conference on Computer Science and Electronics Engineering >Text Emotion Classification Research Based on Improved Latent Semantic Analysis Algorithm
【24h】

Text Emotion Classification Research Based on Improved Latent Semantic Analysis Algorithm

机译:基于改进潜在语义分析算法的文本情感分类研究

获取原文

摘要

The emotion classification of text is an important research direction of text mining. Application on emotion text classification, latent semantic analysis algorithm has advantage of small occupied space, applicable to a large scale of text classifications. Compared with the traditional vector space model, latent semantic analysis algorithms reduce the search space for text classification by means of singular value decomposition for term and document matrix. Moreover, latent semantic analysis algorithms solve the problem of words with multiple meanings by analyzing the term at the semantic level. Using an improved latent semantic analysis algorithm to classify the test set by their emotion. The new cluster centroid is the average vector for each emotion category, and access to emotions classification for training dataset by calculating similarity of the average vector and test textual. The experimental results show that the improved latent semantic analysis algorithm have high precision and recall rate as same as the original algorithm, the efficiency of text emotion classification improved 4 percentage points.
机译:文本的情感分类是文本挖掘的重要研究方向。在情感文本分类中的应用,潜在语义分析算法具有小占用空间的优势,适用于大规模的文本分类。与传统的矢量空间模型相比,潜在语义分析算法通过单数值分解来减少文本分类的搜索空间,术语和文档矩阵。此外,通过在语义级别分析术语,解决了潜在语义分析算法解决了多种含义的问题。使用改进的潜在语义分析算法对他们的情感进行分类。新的集群中心是每个情感类别的平均矢量,通过计算平均矢量的相似性和测试文本的相似性来访问培训数据集的情绪分类。实验结果表明,改进的潜在语义分析算法具有高精度和召回率与原始算法相同,文本情绪分类的效率提高了4个百分点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号