【24h】

Rule Discovery and Probabilistic Modeling for Onomastic Data

机译:异常数据的规则发现和概率建模

获取原文
获取原文并翻译 | 示例

摘要

The naming of natural features, such as hills, lakes, springs, meadows etc., provides a wealth of linguistic information; the study of the names and naming systems is called onomastics. We consider a data set containing all names and locations of about 58,000 lakes in Finland. Using computational techniques, we address two major onomastic themes. First, we address the existence of local dependencies or repulsion between occurrences of names. For this, we derive a simple form of spatial association rules. The results partially validate and partially contradict results obtained by traditional onomastic techniques. Second, we consider the existence of relatively homogeneous spatial regions with respect to the distributions of place names. Using mixture modeling, we conduct a global analysis of the data set. The clusterings of regions are spatially connected, and correspond quite well with the results obtained by other techniques; there are, however, interesting differences with previous hypotheses.
机译:自然特征的命名,例如山丘,湖泊,泉水,草地等,提供了丰富的语言信息;对名称和命名系统的研究称为本体论。我们考虑一个数据集,其中包含芬兰约58,000个湖泊的所有名称和位置。使用计算技术,我们解决了两个主要的本体主题。首先,我们解决名称出现之间的局部依赖或排斥。为此,我们导出了一种简单形式的空间关联规则。结果部分验证并部分矛盾了通过传统本体技术获得的结果。其次,我们考虑到相对于地名分布而言相对均一的空间区域。使用混合建模,我们对数据集进行了全局分析。区域的聚类在空间上相关联,并且与其他技术获得的结果非常吻合;但是,与先前的假设有一些有趣的差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号