首页> 外文期刊>Journal of Systemics, Cybernetics and Informatics >Automatic Identification of Travel Locations in Rare Books - Object Oriented Information Management
【24h】

Automatic Identification of Travel Locations in Rare Books - Object Oriented Information Management

机译:自动识别稀有书籍中的旅行位置-面向对象的信息管理

获取原文
           

摘要

The digital content of the Internet is growing exponentially and mass digitization of printed media opens access to literature, in particular the genre of travel literature from the 18 th and 19 th century, which consists of diaries or travel books describing routes, observations or inspirations. The identification of described locations in the digital text is a long-standing challenge which requires information technology to supply dynamic links to sources by new forms of interaction and synthesis between humanistic texts and scientific observations. Using object oriented information technology, a prototype of a software tool is developed which makes it possible to automatically identify geographic locations and travel routes mentioned in rare books. The information objects contain properties such as names and classification codes for populated places, streams, mountains and regions. Together, with the latitudes and longitudes of every single location, it is possible to georeference this information in order that all processed and filtered datasets can be displayed by a map application. This method has already been used in the Humboldt Digital Library to present Alexander von Humboldt’s maps and was tested in a case study to prove the correctness and reliability of the automatic identification of locations based on the work of Alexander von Humboldt and Johann Wolfgang von Goethe. The results reveal numerous errors due to misspellings, change of location names, equality of terms and location names. But on the other hand it becomes very clear that results of the automatic object detection and recognition can be improved by error-free and comprehensive sources. As a result an increase in quality and usability of the service can be expected, accompanied by more options to detect unknown locations in the descriptions of rare books.
机译:互联网的数字内容呈指数增长,印刷媒体的大规模数字化开放了文学的获取渠道,尤其是18世纪和19世纪旅行文学的类型,其中包括描述路线,观察或灵感的日记或旅行书。数字文本中所描述位置的识别是一项长期的挑战,需要信息技术通过人文文本与科学观测之间的新型交互和综合形式,提供与资源的动态链接。使用面向对象的信息技术,开发了软件工具的原型,该原型使自动识别稀有书籍中提到的地理位置和行进路线成为可能。信息对象包含属性,例如人口场所,溪流,山脉和地区的名称和分类代码。连同每个位置的纬度和经度,可以地理参考此信息,以便地图应用程序可以显示所有经过处理和过滤的数据集。该方法已在洪堡数字图书馆中用于显示亚历山大·冯·洪堡的地图,并在案例研究中进行了测试,以证明基于亚历山大·冯·洪堡和约翰·沃尔夫冈·冯·歌德的工作而自动识别位置的正确性和可靠性。结果显示,由于拼写错误,位置名称更改,术语和位置名称相等而导致的许多错误。但是另一方面,很明显,可以通过无错误且全面的信息源来改善自动目标检测和识别的结果。结果,可以期望提高服务质量和可用性,并伴随着更多的选择来检测稀有书籍描述中的未知位置。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号