Dimensionality Reduction for Text using Domain Knowledge

机译：使用域知识的文本的维度减少

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text documents are complex high dimen-sional objects. To effectively visualize such data it is important to reduce its di-mensionality and visualize the low dimen-sional embedding as a 2-D or 3-D scatter plot. In this paper we explore dimension-ality reduction methods that draw upon domain knowledge in order to achieve a better low dimensional embedding and vi-sualization of documents. We consider the use of geometries specified manually by an expert, geometries derived automat-ically from corpus statistics, and geome-tries computed from linguistic resources.

机译：文本文档是复杂的高模体对象。为了有效地可视化这些数据，重要的是减少其直径并可视化为2-D或3-D散点图的低Dimen-S嵌入。在本文中，我们探讨了绘制域知识的维度 - 持久性方法，以实现更好的低维嵌入和文档的vi - 加速。我们考虑使用专家手动指定的几何形状，从语料库统计到自动派生的几何形状，以及从语言资源计算的Geome-Trives。

著录项

来源
《International conference on computational linguistics》|2010年||共9页
会议地点
作者
Yi Mao; Krishnakumar Balasubramanian; Guy Lebanon;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Assessing text representations with recognition: The interaction of domain knowledge and text coherence [J] . Long DL, Wilson J, Hurley R, Journal of experimental psychology. Learning, memory, and cognition . 2006,第4期

机译：评估具有识别能力的文本表示形式：领域知识与文本连贯性的相互作用
2. Extracting LSA topics as features for text classifiers across different knowledge domains [J] . Nicholas Evangelopoulos, S. Yasaman Amirkiaee Quality & Quantity: International Journal of Methodology . 2020,第1期

机译：将LSA主题提取为不同知识域的文本分类器的功能
3. Text feature selection with a robust weight scheme and dynamic dimension reduction to text document clustering [J] . Abualigah Laith Mohammad, Khader Ahamad Tajudin, Al-Betar Mohammed Azmi, Expert Systems with Application . 2017,第octa期

机译：具有稳健的权重方案和文本文档聚类的动态尺寸缩减功能的文本特征选择
4. Dimensionality Reduction for Text using Domain Knowledge [C] . Yi Mao, Krishnakumar Balasubramanian, Guy Lebanon Workshop on multiword expressions: from theory to application. . 2010

机译：使用领域知识的文本降维
5. Centroid-based dimension reduction methods for classification of high dimensional text data. [D] . Jeon, Moon-Gu. 2001

机译：基于质心的降维方法，用于对高维文本数据进行分类。
6. Colloquium PaperMapping Knowledge Domains: An unsupervised method for the extraction of propositional information from text [O] . Simon Dennis 2004

机译：专题讨论会论文制图知识领域：从文本中提取命题信息的无监督方法
7. Assessing Text Representations With Recognition: The Interaction of Domain Knowledge and Text Coherence [O] . Debra L. Long, Jeannette Wilson, Ryan Hurley, 2013

机译：带有识别的文本表示评估：领域知识与文本连贯性的相互作用

Dimensionality Reduction for Text using Domain Knowledge

摘要

著录项

相似文献

相关主题

期刊订阅