首页> 外文会议>European Conference on Principles and Practice of Knowledge Discovery in Databases >Rule Discovery and Probabilistic Modeling for Onomastic Data
【24h】

Rule Discovery and Probabilistic Modeling for Onomastic Data

机译:粗略数据的规则发现和概率建模

获取原文

摘要

The naming of natural features, such as hills, lakes, springs, meadows etc., provides a wealth of linguistic information; the study of the names and naming systems is called onomastics. We consider a data set containing all names and locations of about 58,000 lakes in Finland. Using computational techniques, we address two major onomastic themes. First, we address the existence of local dependencies or repulsion between occurrences of names. For this, we derive a simple form of spatial association rules. The results partially validate and partially contradict results obtained by traditional onomastic techniques. Second, we consider the existence of relatively homogeneous spatial regions with respect to the distributions of place names. Using mixture modeling, we conduct a global analysis of the data set. The clusterings of regions are spatially connected, and correspond quite well with the results obtained by other techniques; there are, however, interesting differences with previous hypotheses.
机译:自然特征的命名,如丘陵,湖泊,弹簧,草地等,提供了丰富的语言信息;名称和命名系统的研究称为孔子。我们考虑一个数据集,其中包含芬兰大约58,000湖的所有名称和位置。使用计算技术,我们解决了两个主要的杂色主题。首先,我们解决了名称出现之间的本地依赖关系或排斥。为此,我们推出了一种简单的空间关联规则形式。结果部分验证并部分地矛盾通过传统的综合技术获得的结果。其次,我们考虑关于地名分布的相对均匀的空间区域的存在。使用混合建模,我们对数据集进行全局分析。区域的集群在空间上连接,并且与其他技术获得的结果相当良好;然而,与以前的假设有有趣的差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号