首页> 外文会议>International MultiConference of Engineers and Computer Scientists >Extraction of Plant Identification Keys Using Approximate String Matching for Species Properties Classification
【24h】

Extraction of Plant Identification Keys Using Approximate String Matching for Species Properties Classification

机译:用近似弦匹配对物种属性分类的近似弦匹配提取

获取原文

摘要

Most biologists keep data in separate databases. These databases are not necessary well-structured. Plant identification keys are among such data. They are data-rich description containing plant identification terminologies and maybe used to identify various plant species. The way the data is kept often requires the species identification to be done using rules that are applied sequentially. Done manually, this is very time consuming. Information extraction (IE) is a process of selecting information such as names, terms, or phrases, from a natural language text documents. This information is then structured into a specified template for retrieval. This method is applied to plant identification keys kept by the biologists. Before the keys are extracted from the description, they have to go through a number of processes. In this paper, we illustrate the pre-processing and processing methods with an example from a database, with emphasis on the approximate string matching algorithm to extract the most relevant keys from the description.
机译:大多数生物学家将数据保留在单独的数据库中。这些数据库不是必需结构良好的。植物识别键是这样的数据。它们是具有富含植物识别术语的数据丰富的描述,也许用于识别各种植物物种。保留数据的方式通常需要使用顺序应用的规则来完成物种识别。手动完成,这是非常耗时的。信息提取(IE)是从自然语言文本文档中选择诸如名称,术语或短语的信息的过程。然后将该信息构造成指定的用于检索模板。该方法应用于生物学家保存的工厂识别键。在从描述中提取键之前,它们必须经过许多进程。在本文中,我们示出了具有来自数据库的示例的预处理和处理方法,重点是近似串匹配算法从描述中提取最相关的密钥。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号