A Mixture Language Model for Class-Attribute Mining from Biomedical Literature Digital Library

机译：生物医学文献数字图书馆类属性挖掘的混合语言模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We define and study a novel text mining problem for biomedical literature digital library, referred to as the class-attribute mining. Given a collection of biomedical literature from a digital library addressing a set of objects (e.g., proteins) and their descriptions (e.g., protein functions), the tasks of class-attribute mining include: (1) to identify and summarize latent classes in the space of objects, (2) to discover latent attribute themes in the space of object descriptions, and (3) to summarize the commonalities and differences among identified classes along each attribute theme. We approach this mining problem through a mixture language model and estimate the parameters of the model using the EM algorithm. We demonstrate the effectiveness of the model with an application called protein community identification and annotation from Medline, the largest biomedical literature digital library with more than 16 millions abstracts.

机译：我们定义并研究生物医学文献数字图书馆的新型文本挖掘问题，称为类属性挖掘。给定来自一个关于一组对象（例如，蛋白质）的数字图书馆的生物医学文献及其描述（例如，蛋白质函数），类属性挖掘的任务包括：（1）以识别和总结潜在的潜在课程对象的空间，（2）在对象描述的空间中发现潜在的属性主题，（3）总结每个属性主题的识别类之间的共性和差异。我们通过混合语言模型来处理该挖掘问题，并使用EM算法估算模型的参数。我们证明了模型的有效性与蛋白质社区识别和来自Medline的注释，最大的生物医学文献数字图书馆具有超过16000毫升的摘要。

著录项

来源
《IEEE International Conference on Bioinformatics and Biomedicine Workshops》|2007年||共9页
会议地点
作者
Xiaohua Zhou; Xiaohua Hu; Xiaodan Zhang; Daniel D. Wu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 G202-53;
关键词
入库时间 2022-08-21 01:59:39

相似文献

外文文献
中文文献
专利

1. Digital library of required classical literature for elementary and secondary school curricula in domestic languages of Bosnia and Herzegovina [J] . Kanita Besirevic OCLC Systems and Services . 2020,第3期

机译：博斯尼亚和黑塞哥维那国内语言学院和中学课程所需古典文学的数字图书馆
2. Integrating unified medical language system and association mining techniques into relevance feedback for biomedical literature search [J] . Yanqing Ji, Hao Ying, John Tran, BMC Bioinformatics . 2016,第9期

机译：将统一的医学语言系统和关联挖掘技术集成到相关反馈中，以进行生物医学文献搜索
3. Mining novel connections from large online digital library using biomedical ontologies [J] . Xiaohua Hu Library management . 2005,第4a5期

机译：使用生物医学本体从大型在线数字图书馆中挖掘新颖的联系
4. A Mixture Language Model for Class-Attribute Mining from Biomedical Literature Digital Library [C] . Zhou Xiaohua, Hu Xiaohua, Zhang Xiaodan, IEEE International Conference on Bioinformatics and Biomedicine . 2008

机译：生物医学文献数字图书馆类属性挖掘的混合语言模型
5. Cluster-based Query Expansion Using Language Modeling for Biomedical Literature Retrieval. [D] . Xu, Xuheng. 2011

机译：用于生物医学文献检索的使用语言建模的基于聚类的查询扩展。
6. Integrating unified medical language system and association mining techniques into relevance feedback for biomedical literature search [O] . Yanqing Ji, Hao Ying, John Tran, 2016

机译：将统一的医学语言系统和关联挖掘技术集成到相关反馈中以进行生物医学文献搜索
7. UPH Digital Library Miner: A Topic Modelling-based Software Application for Mining Document Collections of a Digital Library [O] . Toluwase A., Ayodeji I., Ifeanyi C. 2015

机译：UPH数字图书馆矿工：用于数字图书馆的挖掘文件集合的基于主题建模的软件应用程序
8. Use of Permanent Paper for Biomedical Literature. Summary of the Proceedings of the National Library of Medicine Board of Regents Hearing (January 27, 1987) [R] . Kalina, C. R. 1987

机译：生物医学文献永久性纸张的使用。国家医学图书馆会议记录委员会摘要（1987年1月27日）

A Mixture Language Model for Class-Attribute Mining from Biomedical Literature Digital Library

摘要

著录项

相似文献

相关主题

期刊订阅