首页> 外文会议>AAAI Symposium >Scientific Data and Document Processing in Chem{sub}xSeer
【24h】

Scientific Data and Document Processing in Chem{sub}xSeer

机译:Chem {Sub} XSEER的科学数据和文献处理

获取原文

摘要

Chem{sub}xSeer is a digital library and a data repository for the chemistry domain. The data deposited into our repository is linked with digital documents to create aggregates of resources representing the links between the data and the articles in which the data is reported. Chem{sub}xSeer enables the user to annotate the data using a metadata capturing tool. The metadata is indexed and searched to return relevant datasets to the user. Chem{sub}xSeer extracts chemical formulae and chemical names, disambiguates them and indexes them to allow for domain-knowledge enhanced search capabilities. As search engines mature, we foresee such vertical search engines, employing domain-specific knowledge to perform information extraction and indexing, especially for scientific domains, become more popular. Though substantial research has been pursued on information extraction from text, extracting information from tables and figures has received little attention. In the Chem{sub}xSeer project, we are building tools that allow automatic extraction of tables and figures.
机译:Chem {Sub} Xseer是一种数字库和化学域的数据存储库。存放到我们的存储库中的数据与数字文档相关联,以创建表示数据和报告数据的文章之间的链接的资源的聚合。 Chem {Sub} XSEER使用户能够使用元数据捕获工具向数据注释。元数据被索引并搜索以将相关数据集返回给用户。 Chem {Sub} Xseer提取化学公式和化学名称,歧义它们并索引它们以允许域名知识增强的搜索功能。作为搜索引擎成熟,我们预见了这种垂直搜索引擎,采用特定于域的知识来执行信息提取和索引,特别是对于科学域,变得更加流行。虽然已在文本提取信息中进行了大量研究,但从表格和数字中提取信息收到了很少的关注。在Chem {Sub} XSEER项目中,我们正在构建工具,允许自动提取表格和图。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号