首页> 外文期刊>Annals of the New York Academy of Sciences >Creating Reference Datasets for Systems Biology Applications Using Text Mining
【24h】

Creating Reference Datasets for Systems Biology Applications Using Text Mining

机译:使用文本挖掘为系统生物学应用程序创建参考数据集

获取原文
获取原文并翻译 | 示例
       

摘要

High-throughput experimental techniques are generating large data collections with the aim of identifying novel entities involved in fundamental cellular processes as well as drawing a systematic picture of the relationships between individual components. Determining the accuracy of the resulting data and the selection of a subset of targets for more careful characterizations often requires relying on information provided by manually annotated data repositories. These repositories are incomplete and cover only a small fraction of the knowledge contained in the literature. We propose in this paper the use of text-mining technologies to extract, organize, and present information relevant for a particular biological topic. The aims of the resulting approach are (1) to enable topic-centric biological literature navigation, (2) to assist in the construction of manually revised data repositories, (3) to provide prioritization of biological entities for experimental studies, and (4) to enable human interpretation of large-scale experiments by providing direct links of bio-entities to relevant descriptions in the literature.
机译:高通量实验技术正在生成大量数据,其目的是识别参与基本细胞过程的新型实体,并绘制各个组件之间关系的系统图。确定结果数据的准确性和选择目标子集以进行更仔细的表征通常需要依靠手动注释的数据存储库提供的信息。这些资料库不完整,仅涵盖文献中所包含知识的一小部分。我们建议在本文中使用文本挖掘技术来提取,组织和呈现与特定生物学主题相关的信息。最终方法的目标是(1)启用以主题为中心的生物学文献导航;(2)协助构建手动修订的数据存储库;(3)为实验研究提供生物学实体的优先级;以及(4)通过提供生物实体与文献中相关描述的直接链接,使人类能够对大型实验进行解释。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号