MegaMiner: A Tool for Lead Identification Through Text Mining Using Chemoinformatics Tools and Cloud Computing Environment

Karthikeyan Muthukumarasamy; Pandit Yogesh; Pandit Deepak; Vyas Renu

首页> 外文期刊>Combinatorial chemistry & high throughput screening >MegaMiner: A Tool for Lead Identification Through Text Mining Using Chemoinformatics Tools and Cloud Computing Environment

【24h】

MegaMiner: A Tool for Lead Identification Through Text Mining Using Chemoinformatics Tools and Cloud Computing Environment

机译：MegaMiner：使用化学信息学工具和云计算环境通过文本挖掘进行铅识别的工具

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Virtual screening is an indispensable tool to cope with the massive amount of data being tossed by the high throughput omics technologies. With the objective of enhancing the automation capability of virtual screening process a robust portal termed MegaMiner has been built using the cloud computing platform wherein the user submits a text query and directly accesses the proposed lead molecules along with their drug-like, lead-like and docking scores. Textual chemical structural data representation is fraught with ambiguity in the absence of a global identifier. We have used a combination of statistical models, chemical dictionary and regular expression for building a disease specific dictionary. To demonstrate the effectiveness of this approach, a case study on malaria has been carried out in the present work. MegaMiner offered superior results compared to other text mining search engines, as established by F score analysis. A single query term 'malaria' in the portlet led to retrieval of related PubMed records, protein classes, drug classes and 8000 scaffolds which were internally processed and filtered to suggest new molecules as potential anti-malarials. The results obtained were validated by docking the virtual molecules into relevant protein targets. It is hoped that MegaMiner will serve as an indispensable tool for not only identifying hidden relationships between various biological and chemical entities but also for building better corpus and ontologies.

机译：虚拟筛选是应对高通量组学技术抛弃大量数据的必不可少的工具。为了增强虚拟筛选过程的自动化能力，已使用云计算平台构建了一个名为MegaMiner的强大门户网站，其中用户提交文本查询并直接访问建议的铅分子以及它们的类药物，类铅和对接分数。在没有全局标识符的情况下，文本化学结构数据表示充满歧义。我们使用了统计模型，化学词典和正则表达式的组合来构建特定疾病的词典。为了证明这种方法的有效性，目前在疟疾方面进行了案例研究。与通过F得分分析确定的其他文本挖掘搜索引擎相比，MegaMiner提供了更好的结果。 Portlet中的单个查询词“疟疾”导致检索了相关的PubMed记录，蛋白质类别，药物类别和8000个支架，这些支架在内部进行了处理和过滤，以暗示新的分子可能作为抗疟疾药物。通过将虚拟分子对接至相关的蛋白质靶标来验证获得的结果。希望MegaMiner将成为必不可少的工具，不仅可以识别各种生物和化学实体之间的隐藏关系，而且可以构建更好的语料库和本体。

著录项

来源
《Combinatorial chemistry & high throughput screening》 |2015年第6期|共13页
作者
Karthikeyan Muthukumarasamy; Pandit Yogesh; Pandit Deepak; Vyas Renu;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类化学;
关键词
Chemoinformatics; cloud computing; malaria; text mining; virtual screening;

机译：化学信息学;云计算;疟疾;文本挖掘;虚拟筛选;

相似文献

外文文献
中文文献
专利

1. MegaMiner: A Tool for Lead Identification Through Text Mining Using Chemoinformatics Tools and Cloud Computing Environment [J] . Karthikeyan Muthukumarasamy, Pandit Yogesh, Pandit Deepak, Combinatorial chemistry & high throughput screening . 2015,第6期

机译：MegaMiner：使用化学信息学工具和云计算环境通过文本挖掘进行铅识别的工具
2. METSP: A Maximum-Entropy Classifier Based Text Mining Tool for Transporter-Substrate Identification with Semistructured Text [J] . MinZhao, YanmingChen, DachengQu, BioMed research international . 2015,第1期

机译：METSP：一种基于最大熵分类器的文本挖掘工具，用于半结构化文本的转运基质识别
3. TMT-HCC: A tool for text mining the biomedical literature for hepatocellular carcinoma (HCC) biomarkers identification [J] . SeoudR.A.A., MabroukM.S. Computer Methods and Programs in Biomedicine: An International Journal Devoted to the Development, Implementation and Exchange of Computing Methodology and Software Systems in Biomedical Research and Medical Practice . 2013,第3期

机译：TMT-HCC：用于文本挖掘生物医学文献以鉴定肝细胞癌（HCC）生物标志物的工具
4. High-Performance Mining of COVID-19 Open Research Datasets for Text Classification and Insights in Cloud Computing Environments [C] . Jie Zhao, Maria A. Rodriguez, Rajkumar Buyya IEEE/ACM International Conference on Utility and Cloud Computing . 2020

机译：Covid-19开放研究数据集的高性能挖掘，用于云计算环境中的文本分类和见解
5. An Anonymous and Distributed Approach to Improving Privacy in Cloud Computing: An Analysis of Privacy-Preserving Tools & Applications [D] . Peters, Emmanuel Sean. 2017

机译：匿名和分布式方法来改善云计算中的隐私：隐私保护工具和应用程序的分析
6. METSP: A Maximum-Entropy Classifier Based Text Mining Tool for Transporter-Substrate Identification with Semistructured Text [O] . Min Zhao, Yanming Chen, Dacheng Qu, -1

机译：METSP：基于最大熵分类器的文本挖掘工具用于半结构化文本的转运体-基质识别
7. Testing a text mining tool for emerging risk identification [O] . Niels B. Lucas Luijckx, Fred J. van de Brug, Winfried R. Leeman, 2016

机译：测试新出现风险识别的文本挖掘工具

MegaMiner: A Tool for Lead Identification Through Text Mining Using Chemoinformatics Tools and Cloud Computing Environment

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅