Generating Different Semantic Spaces for Document Classification

机译：为文档分类生成不同的语义空间

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Document classification is an important technique in the field of digital library, WWW pages etc. Due to the problems of synonymy and polysemy, it is better to classify documents based on latent semantics. The local semantic basis, which contains the features of documents within a particular category, has more discriminate power and is more effective in classification than global semantic basis which contains the common features of all documents available. Because the semantic basis obtained by Nonnegative matrix factorization has a straightforward correspondence with samples while the semantic basis obtained by Singular value decomposition doesnt, NMF is suitable to obtain the local semantic basis. In this paper, global and local semantic bases obtained by SVD and NMF are compared. The experimental results show that the best classification accuracy is achieved by local semantic basis obtained by NMF.

机译：文档分类是数字图书馆领域的重要技术，www页面等由于同义词和多士密化的问题，最好基于潜在语义来分类文档。本地语义基础，其中包含特定类别中的文档的功能，具有比全局语义基础更有效的权力，并且在分类中更有效，其中包含可用的所有文档的共同功能。由于非负矩阵分解获得的语义基础与样本具有直截了当的对应关系，而单数值分解而获得的语义基于NOT，则NMF适合于获得局部语义基础。在本文中，比较了通过SVD和NMF获得的全局和局部语义基础。实验结果表明，最佳分类精度是通过NMF获得的局部语义基础实现的。

著录项

来源
《Advanced Workshop on Content Computing》|2004年||共7页
会议地点
作者
Jianjiang Lu; Baowen Xu; Jixiang Jiang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Chinese semantic document classification based on strategies of semantic similarity computation and correlation analysis [J] . Yang Shuo, Wei Ran, Guo Jingzhi, Journal of web semantics: . 2020,第Auga期

机译：基于语义相似性计算与相关分析策略的汉语语义文献分类
2. Semantic Document Classification based on Strategies of Semantic Similarity Computation and Correlation Analysis [J] . Shuo Yang, Ran Wei, Hengliang Tan, Computer Science & Information Technology . 2019,第13期

机译：基于语义相似度计算和相关分析策略的语义文档分类
3. On some recent progress in the classification of ( P and Q ) documentclass[12pt]{minimal} usepackage{amsmath} usepackage{wasysym} usepackage{amsfonts} usepackage{amssymb} usepackage{amsbsy} usepackage{mathrsfs} usepackage{upgreek} setlength{oddsidemargin}{-69pt} egin{document}$$(Phbox { and }Q)$$end{document} -polynomial association schemes [J] . Alexander L. Gavrilyuk, Jack H. Koolen Arabian Journal of Mathematics . 2021,第1期

机译：关于<内联公式ID =“IEQ1”> <替代方案> （ P 和 q ） documentClass [ 12pt] {minimal} usepackage {ammath} usepackage {isysym} usepackage {amsfonts} usepackage {amssymb} usepackage {amsbsy} usepackage {mathrsfs} usepackage {supmeek} setLength { oddsidemargin} { - 69pt} begin {document} $$（p hbox {and} q）$$ end {document} -Polynomial协会计划
4. Generating Different Semantic Spaces for Document Classification [C] . Jianjiang Lu, Baowen Xu, Jixiang Jiang Advanced Workshop on Content Computing(AWCC 2004); 20041115-17; ZhenJiang(CN) . 2004

机译：生成用于文档分类的不同语义空间
5. Computer-aided Semantic Signature Identification and Document Classification via Semantic Signatures. [D] . Para, Uday Kiran. 2010

机译：通过语义签名的计算机辅助语义签名识别和文档分类。
6. MOWDOC: A Dataset of Documents From Taking the Measure of Work for Building a Latent Semantic Analysis Space [O] . Kim F. Nimon 2020

机译：mowdoc：从衡量建立潜在语义分析空间的工作的文件数据集
7. Development of a computer system for generating semantic template of a group of documents by using latent semantic analysis [O] . Yuriy Taranenko, Maryna Kabanova 2016

机译：开发用于通过使用潜在语义分析生成一组文档的语义模板的计算机系统

Generating Different Semantic Spaces for Document Classification

摘要

著录项

相似文献

相关主题

期刊订阅