The Effectiveness of a Graph-Based Algorithm for Stemming

机译：基于图形的茎秆算法的有效性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In Information Retrieval (IR), stemming enables a matching of query and document terms which are related to a same meaning but which can appear in different morphological variants. In this paper we will propose and evaluate a statistical graph-based algorithm for stemming. Considering that a word is formed by a stem (prefix) and a derivation (suffix), the key idea is that strongly interlinked prefixes and suffixes form a community of sub-strings. Discovering these communities means searching for the best word splits which give the best word stems. We conducted some experiments on CLEF 2001 test sub-collections for Italian language. The results show that stemming improve the IR effectiveness. They also show that effectiveness level of our algorithm is comparable to that of an algorithm based on a-priori linguistic knowledge. This is an encouraging result, particularly in a multi-lingual context.

机译：在信息检索（IR）中，Stemming使得查询和文档术语的匹配与相同的含义相关但是可以出现在不同的形态变异中。在本文中，我们将提出并评估基于统计图的终测算法。考虑到词根（前缀）和衍生（后缀）形成一个单词，关键的想法是强烈互连的前缀和后缀形成了子字符串的社区。发现这些社区意味着寻找最好的词拆分，给出最好的单词茎。我们对意大利语的Clef 2001测试子集合进行了一些实验。结果表明，源病提高了红外效果。他们还表明，我们的算法的有效性水平与基于a-priori语言知识的算法的效力水平相当。这是一个令人鼓舞的结果，特别是在多语言背景下。

著录项

来源
《International conference on Asian digital libraries》|2002年||共12页
会议地点
作者
Michela Bacchin; Nicola Ferro; Massimo Melucci; Lecture Notes in Computer Science 2555;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类电子图书馆、数字图书馆;
关键词

相似文献

外文文献
中文文献
专利

1. Graph-based algorithms and data-driven documents for formulation and visualization of large MDO systems [J] . Benedikt Aigner, Imco van Gent, Gianfranco La Rocca, CEAS Aeronautical Journal . 2018,第4期

机译：基于图形的算法和数据驱动的文档，用于大型MDO系统的公式化和可视化
2. A Graph-Based Taxonomy of Recommendation Algorithms and Systems in LBSNs [J] . Kefalas Pavlos, Symeonidis Panagiotis, Manolopoulos Yannis Knowledge and Data Engineering, IEEE Transactions on . 2016,第3期

机译：LBSN中基于图的推荐算法和系统分类法
3. Graph-based low complexity detection algorithms in multiple-input-multiple-out systems: an edge selection approach [J] . Lv, T., Long, Communications, IET . 2013,第12期

机译：多输入多输出系统中基于图的低复杂度检测算法：一种边缘选择方法
4. The Effectiveness of a Graph-Based Algorithm for Stemming [C] . Michela Bacchin, Nicola Ferro, Massimo Melucci, International conference on Asian digital libraries . 2002

机译：基于图形的茎秆算法的有效性
5. Fast Graph-Based Algorithms for Analyzing Protein-Protein Interaction Networks [D] . Shen, Yue. 2019

机译：基于快速的图形算法分析蛋白质 - 蛋白质相互作用网络
6. Algorithms for effective querying of compound graph-based pathway databases [O] . Ugur Dogrusoz, Ahmet Cetintas, Emek Demir, 2009

机译：基于复合图的路径数据库的有效查询算法
7. Graph-based Sequence Clustering through Multiobjective Evolutionary Algorithms for Web Recommender Systems [O] . Gul Nildem Demir, A. Sima Uyar, Sule Oguducu 2009

机译：基于图的序列聚类 - 基于多目标进化算法的Web推荐系统

The Effectiveness of a Graph-Based Algorithm for Stemming

摘要

著录项

相似文献

相关主题

期刊订阅