首页> 外文会议>International conference on Information and knowledge management >A method of geographical name extraction from Japanese text for thematic geographical search
【24h】

A method of geographical name extraction from Japanese text for thematic geographical search

机译:从日语文本中提取地理名称以进行主题地理搜索的方法

获取原文

摘要

A text retrieval method called the thematic geographical search method has been developed and applied to a Japanese encyclopedia called the World Encyclopaedia. In this method, the user specifies a search theme using free words, then obtains a sorted list of excerpts and hyperlinks to encyclopedia sentences that contain geographical names. Using this list, the user can also open maps that indicate the locations of the names. To generate an index of names for this searching, a method of extracting geographical names has been developed. In this method, geographical names are extracted, matched to names in a geographical name database, and identified. Geographical names, however, often have several types of ambiguities. Ambiguities are resolved by using non-local context analysis, which uses a stack and several other techniques. As a result, the precision of extracted names is more than 96% on average. This method depends on features of the Japanese language, but the strategy and most of the techniques can be applied to texts in English or other languages.

机译:

已开发出一种称为主题地理搜索方法的文本检索方法,并将其应用于日本的百科全书中,即“世界百科全书”。在这种方法中,用户使用自由词指定搜索主题,然后获得摘录和到包含地理名称的百科全书句子的超链接的排序列表。使用此列表,用户还可以打开指示名称位置的地图。为了产生用于该搜索的名称索引,已经开发了提取地理名称的方法。在这种方法中,提取地名,使其与地名数据库中的名称匹配并进行标识。但是,地名通常具有多种类型的歧义。通过使用非本地上下文分析解决歧义,该非本地上下文分析使用堆栈和其他几种技术。结果,提取的名称的平均精度平均超过96%。这种方法取决于日语的功能,但是该策略和大多数技术都可以应用于英语或其他语言的文本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号