Constructing Empirical Formulas for Testing Word Similarity by the Inductive Method of Model Self-Organization

机译：构建模型自组织归纳方法检测单词相似性的经验公式

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Identification of words with the same base meaning is a necessary procedure for many algorithms of computational linguistics and text processing. We propose to use for this a knowledge-poor approach using an empirical formula based on the number of the coincident letters in the initial parts of the two words and the number of non-coincident letters in the final parts of these two words. To construct such a formula for a given language, we use inductive method of self-organization developed by A. Ivahnenko. This method considers a set of models (formulas) of a given class and selects the best ones using training samples and test samples. We give a detailed example for English. We also show how to apply the formula for creating word frequency list.

机译：识别具有相同基本含义的单词是许多计算语言学和文本处理的许多算法的必要过程。我们建议使用基于两个单词的初始部分中的重合字母的数量和这两个单词的最终部分中的非重合字母数的重合字母的数量来使用经验公式的知识差的方法。为了为给定语言构建这种公式，我们使用由Avahnenko开发的自组织的归纳方法。该方法考虑了一组给定类的模型（公式），并使用培训样本和测试样本选择最佳的模型。我们提供详细的英语示例。我们还展示了如何应用用于创建字频率列表的公式。

著录项

来源
《International Conference on Portugal for Natural Language Processing》|2002年||共9页
会议地点
作者
Pavel Makagonov; Mikhail Alexandrov;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词

相似文献

外文文献
中文文献
专利

1. Constructing and validating word similarity datasets by integrating methods from psychology, brain science and computational linguistics [J] . Yu Wan, Yidong Chen, Xiaodong Shi, Soft computing: A fusion of foundations, methodologies and applications . 2018,第21期

机译：通过将方法从心理学，脑科学和计算语言学集成来构建和验证单词相似度数据集
2. Self-organization in the commons: An empirically-tested model [J] . Ghorbani Amineh, Bravo Giangiacomo, Frey Ulrich, Environmental Modelling & Software . 2017,第octa期

机译：公地的自组织：一个经验检验的模型
3. New Methods for Processing Experimental Data: Applicability Tests for Empirical Formulas [J] . A. D. Polyanin, E. A. Vyazmina, V. V. Dilman Theoretical foundations of chemical engineering . 2008,第4期

机译：处理实验数据的新方法：经验公式的适用性测试
4. Constructing Empirical Formulas for Testing Word Similarity by the Inductive Method of Model Self-Organization [C] . Pavel Makagonov, Mikhail Alexandrov International Conference on Portugal for Natural Language Processing . 2002

机译：构建模型自组织归纳方法检测单词相似性的经验公式
5. Vocabulary learning through use of the picture-word inductive model for young English learners in China: A mixed methods examination using Cognitive Load Theory. [D] . Jiang, Xuan. 2014

机译：通过使用图词归纳模型对中国年轻英语学习者进行词汇学习：使用认知负荷理论的混合方法考试。
6. Three little words: an empirical test of the optimum scoring method for the RCP 3 questions [O] . David Price, Stan Musgrave, Amanda Lee, 2005

机译：三个小词：对RCP 3个问题的最佳评分方法的实证检验
7. Performance of Inductive Method of Model Self-Organization with Incomplete Model and Noisy Data* [O] . Natalia Ponomareva, Mikhail Alex, Er Gelbukh 2014

机译：不完全模型和噪声数据模型自组织归纳法的性能*
8. Empirical Formula Determination with an Inductively Coupled Plasma Gas Chromatographic Detector. [R] . Windsor, D. L., Denton, M. B. 1979

机译：用电感耦合等离子体气相色谱检测器确定经验公式。

Constructing Empirical Formulas for Testing Word Similarity by the Inductive Method of Model Self-Organization

摘要

著录项

相似文献

相关主题

期刊订阅