Clustering in a News Corpus

机译：在新闻语料库中聚类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We adapt the Suffix Tree Clustering method for application within a corpus of Norwegian news articles. Specifically, suffixes are replaced with n-grams and we propose a new measure for cluster similarity as well as a scoring-function for base-clusters. These modifications lead to substantial improvements in effectiveness and efficiency compared to the original algorithm.

机译：我们在挪威新闻文章的语料库中调整后缀树聚类方法。具体而言，后缀被N-GRAM替换，我们提出了一种群集相似性的新措施以及基础集群的得分函数。与原始算法相比，这些修改导致有效性和效率的实质性提高。

著录项

来源
《International Conference on Text, Speech and Dialogue》|2014年||共7页
会议地点
作者
Richard Elling Moe;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.1-53;
关键词

相似文献

外文文献
中文文献
专利

1. Analysing headlines as a way of downsizing news corpora: Evidence from an Arabic-English comparable corpus of newspaper articles [J] . Haider Ahmad S., Hussein Riyad F. Literary & linguistic computing . 2020,第4期

机译：分析头条新闻作为缩小新闻学习的方式：来自阿拉伯语 - 英语的证据报纸文章
2. Newsgroup Topic Extraction using Probabilistic Inverse Cluster Frequency Term-Cluster Weighting and Growing Neural Gas Clustering [J] . Sigit Adinugroho, Muh Arif Rahman, Dahnial Syauqy IAENG Internaitonal journal of computer science . 2021,第1Pta1期

机译：新闻组主题采用概率逆簇频率术语 - 群集加权和生长神经气体聚类
3. Stance markers in English medical research articles and newspaper opinion columns: A comparative corpus-based study [J] . Qian Shen, Yating Tao PLoS One . 2021,第3期

机译：英语医学研究文章和报纸舆论专栏的立场标记：基于比较的语料库研究
4. Clustering Sinhala News Articles Using Corpus-Based Similarity Measures [C] . Purnima Nanayakkara, Surangika Ranathunga 4th International Moratuwa Engineering Research Conference . 2018

机译：使用基于语料库的相似性度量将僧伽罗语新闻文章聚类
5. 'Pauper Aliens' and 'Political Refugees': A Corpus Linguistic Approach to the Language of Migration in Nineteenth-Century Newspapers [D] . Byrne, Ruth. 2020

机译：'贫民外星人'和“政治难民”：一种迁移语言的语料库语言方法在十九世纪报刊的迁移语言
6. The SFU Opinion and Comments Corpus: A Corpus for the Analysis of Online News Comments [O] . Varada Kolhatkar, Hanhan Wu, Luca Cavasso, -1

机译：SFU意见和评论语料库：分析在线新闻评论的语料库
7. The SFU Opinion and Comments Corpus: A Corpus for the Analysis of Online News Comments [O] . Varada Kolhatkar, Hanhan Wu, Luca Cavasso, 2019

机译：SFU意见和评论语料库：用于在线新闻评论分析的语料库

Clustering in a News Corpus

摘要

著录项

相似文献

相关主题

期刊订阅