首页> 外文会议>Aisa-Pacific web and web-age information management joint conference on web and big data >DeepAM: Deep Semantic Address Representation for Address Matching
【24h】

DeepAM: Deep Semantic Address Representation for Address Matching

机译:DeepAM:用于地址匹配的深度语义地址表示

获取原文

摘要

Address matching is a crucial task in various location-based businesses like take-out services and express delivery, which aims at identifying addresses referring to the same location in address databases. It is a challenging one due to various possible ways to express the address of a location, especially in Chinese. Traditional address matching approaches relying on string similarities and learning matching rules to identify addresses referring to the same location, could hardly solve the cases with redundant, incomplete or unusual expression of addresses. In this paper, we propose to map every address into a fixed-size vector in the same vector space using state-of-the-art deep sentence representation techniques and then measure the semantic similarity between addresses in this vector space. The attention mechanism is also applied to the model to highlight important features of addresses in their semantic representations. Last but not least, we novelly propose to get rich contexts for addresses from the web through web search engines, which could strongly enrich the semantic meaning of addresses that could be learned. Our empirical study conducted on two real-world address datasets demonstrates that our approach greatly improves both precision (up to 5%) and recall (up to 8%) of the state-of-the-art existing methods.
机译:在各种基于位置的业务(如外卖服务和快递)中,地址匹配是一项至关重要的任务,其目的是在地址数据库中标识引用同一位置的地址。由于存在各种可能的方式来表达地点的地址,这尤其具有挑战性,尤其是中文。传统的地址匹配方法依靠字符串相似性和学习匹配规则来识别引用相同位置的地址,很难解决地址冗余,不完整或异常表达的情况。在本文中,我们建议使用最新的深层句子表示技术将每个地址映射到相同向量空间中的固定大小的向量中,然后测量该向量空间中地址之间的语义相似性。注意机制还应用于模型,以在其语义表示中突出显示地址的重要特征。最后但并非最不重要的一点是,我们新颖地建议通过Web搜索引擎从Web获得丰富的地址地址上下文,这可以极大地丰富可以学习的地址的语义。我们在两个真实世界的地址数据集上进行的实证研究表明,我们的方法极大地提高了现有技术的准确性(最高5%)和查全率(最高8%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号