首页> 外文期刊>Frontiers in Pharmacology >Asymmetric author-topic model for knowledge discovering of big data in toxicogenomics
【24h】

Asymmetric author-topic model for knowledge discovering of big data in toxicogenomics

机译:毒理基因组学中大数据知识发现的不对称作者-主题模型

获取原文
           

摘要

The advancement of high-throughput screening technologies facilitates the generation of massive amount of biological data, a big data phenomena in biomedical science. Yet, researchers still heavily rely on keyword search and/or literature review to navigate the databases and analyses are often done in rather small-scale. As a result, the rich information of a database has not been fully utilized, particularly for the information embedded in the interactive nature between data points that are largely ignored and buried. For the past 10 years, probabilistic topic modeling has been recognized as an effective machine learning algorithm to annotate the hidden thematic structure of massive collection of documents. The analogy between text corpus and large-scale genomic data enables the application of text mining tools, like probabilistic topic models, to explore hidden patterns of genomic data and to the extension of altered biological functions. In this paper, we developed a generalized probabilistic topic model to analyze a toxicogenomics dataset that consists of a large number of gene expression data from the rat livers treated with drugs in multiple dose and time-points. We discovered the hidden patterns in gene expression associated with the effect of doses and time-points of treatment. Finally, we illustrated the ability of our model to identify the evidence of potential reduction of animal use.
机译:高通量筛选技术的发展促进了大量生物数据的生成,这是生物医学中的一种大数据现象。然而,研究人员仍然严重依赖关键字搜索和/或文献综述来浏览数据库,并且分析通常是在相当小的范围内进行的。结果,尚未充分利用数据库的丰富信息,特别是对于嵌入在很大程度上被忽略和掩埋的数据点之间的交互性质的信息。在过去的十年中,概率主题建模已被公认为是一种有效的机器学习算法,用于注释大量文档的隐藏主题结构。文本语料库与大规模基因组数据之间的类比使文本挖掘工具(如概率主题模型)的应用能够探索基因组数据的隐藏模式并扩展已改变的生物学功能。在本文中,我们开发了一种广义概率主题模型,用于分析毒理基因组学数据集,该数据集由来自在多个剂量和时间点用药物治疗的大鼠肝脏的大量基因表达数据组成。我们发现了与剂量和治疗时间点有关的基因表达中的隐藏模式。最后,我们说明了我们的模型识别潜在减少动物使用证据的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号