首页> 外文会议>Workshop on multiword expressions: from theory to application. >Dimensionality Reduction for Text using Domain Knowledge
【24h】

Dimensionality Reduction for Text using Domain Knowledge

机译:使用领域知识的文本降维

获取原文
获取原文并翻译 | 示例

摘要

Text documents are complex high dimen-sional objects. To effectively visualize such data it is important to reduce its di-mensionality and visualize the low dimen-sional embedding as a 2-D or 3-D scatter plot. In this paper we explore dimension-ality reduction methods that draw upon domain knowledge in order to achieve a better low dimensional embedding and vi-sualization of documents. We consider the use of geometries specified manually by an expert, geometries derived automat-ically from corpus statistics, and geome-tries computed from linguistic resources.
机译:文本文档是复杂的高维对象。为了有效地可视化此类数据,重要的是减小其尺寸并将低维嵌入可视化为2D或3D散点图。在本文中,我们探索了基于领域知识的降维方法,以实现更好的低维文档嵌入和可视化。我们考虑使用由专家手动指定的几何图形,从语料统计中自动得出的几何图形以及从语言资源中计算出的几何图形。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号