首页> 外文会议>European Conference on Artificial Intelligence >A linguistic and machematical method for mapping thematic trends from texts
【24h】

A linguistic and machematical method for mapping thematic trends from texts

机译:一种从文本映射专题趋势的语言与对规方法

获取原文

摘要

We present a novel method for mapping thematic trends called Classification by Preferential Clustered link" (CPCL). This method clusters relevant textual units (terms) from a corpus of texts, based on meaningful linguistic relations (syntacticvariations) identified amongst the units. Terms related through syntactic variations are represented in the form of a graph and are first clustered into connected components using the subset of variation relations affecting the modifier word(s) in a term. The connected components are in turn clustered into classes using the subset of variation relations affecting the head word in a term. Through a chronological analysis of the terms, the method pinpoints the evolution of research topics. The CPCL methoddiffers from classical data analysis methods in that it integrates n meaningful linguistic relations as classification criteria. Also, the method avoids the bias caused by fixing class size before classification and thus splitting classes artificiallyduring clustering. The graph formalism, the theoretical model underlying the CPCL method offers a powerful means of representing the linguistic relations between terms.
机译:我们提出映射专题趋势的新方法被称为分类的优惠集群链接”(CPCL)。这种方法集群相关的文本单元(项)从文本的语料库,基于有意义的语言关系确定的单位之间(syntacticvariations)。相关条款通过句法变化以图表的形式表示,并使用影响的项的改性剂字(一个或多个)的变化关系的子集被第一群集到连接部件。该连接部件被依次聚集成使用变化关系的所述子集的类影响一个长期的头一句话。通过条款的时间分析,该方法精确定位的研究课题的演变。从它集成了ň有意义的语言关系作为分类标准的古典数据分析方法的CPCL methoddiffers。此外,该方法可避免之前分类和由此分离类人工造成定影类大小的偏置lyduring集群。该图形式主义,理论模型基础的方法CPCL提供代表术语之间的关系语言的有力手段。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号