Selecting Labels for News Document Clusters

机译：选择新闻文档集群的标签

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This work deals with determination of meaningful and terse cluster labels for News document clusters. We analyze a number of alternatives for selecting headlines and/or sentences of document in a document cluster (obtained as a result of an entity-event-duration query), and formalize an approach to extracting a short phrase from well-supported headlines/sentences of the cluster that can serve as the cluster label. Our technique maps a sentence into a set of significant stems to approximate its semantics, for comparison. Eventually a cluster label is extracted from a selected headline/sentence as a contiguous sequence of words, resuscitating word sequencing information lost in the formalization of semantic equivalence.

机译：这项工作涉及新闻文件集群的有意义和集群标签的确定。我们分析了许多用于在文档群集中选择文档的标题和/或句子的许多替代方案（由实体 - 事件持续时间查询获得），并将一种从受支持的头条/句子中提取短语的方法可以作为群集标签的群集。我们的技术将一个句子映射到一组重要的茎中以近似其语义，以进行比较。最终，从选定的标题/句子中提取群集标签作为连续的单词序列，重新刺除在语义等效的形式化中丢失的单词排序信息。

著录项

来源
《International Conference on Applications of Natural Language to Information Systems》|2007年||共12页
会议地点
作者
Krishnaprasad Thirunarayan; Trivikram Immaneni; Mastan Vali Shaik;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Improving hierarchical document cluster labels through candidate term selection [J] . Fabiano Fernandes dos Santos, Veronica Oliveira de Carvalho, Solange Oliveira Rezende Intelligent decision technologies . 2012,第1期

机译：通过候选词选择改善层次结构文档簇标签
2. Self-Organizing Map vs Initial Centroid Selection Optimization to Enhance K-Means with Genetic Algorithm to Cluster Transcribed Broadcast News Documents [J] . Maghawry Ahmed, Omar Yasser M. K., Badr Amr The international arab journal of information technology . 2020,第3期

机译：自组织地图与初始心针选择优化，以增强K-mean，以遗传算法为群集转录的广播新闻文档
3. Self-Organizing Map vs Initial Centroid Selection Optimization to Enhance K-Means with Genetic Algorithm to Cluster Transcribed Broadcast News Documents [J] . Current Organic Synthesis . 2020,第3期

机译：自组织地图VS初始心针选择优化，以增强K-means，以遗传算法为群集转录的广播新闻文档
4. Selecting Labels for News Document Clusters [C] . Krishnaprasad Thirunarayan, Trivikram Immaneni, Mastan Vali Shaik International Conference on Applications of Natural Language to Information Systems(NLDB 2007); 20070627-29; Paris(FR) . 2007

机译：为新闻文档集群选择标签
5. Text document topical recursive clustering and automatic labeling of a hierarchy of document clusters. [D] . Li, Xiaoxiao. 2012

机译：文本文档主题递归群集和文档群集层次结构的自动标记。
6. Pharmaceutical Industry Off-label Promotion and Self-regulation: A Document Analysis of Off-label Promotion Rulings by the United Kingdom Prescription Medicines Code of Practice Authority 2003–2012 [O] . Andreas Vilhelmsson, Courtney Davis, Shai Mulinari 2016

机译：制药业标签外促销和自我监管：英国处方药业务守则权威机构2003-2012年标签外促销规定的文件分析
7. Selecting Labels for News Document Clusters [O] . Thirunarayan, Krishnaprasad, Immaneni, Trivikram, Shaik, Mastan 2014

机译：为新闻文档集群选择标签

Selecting Labels for News Document Clusters

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅