Exploring Active Learning Based on Representativeness and Uncertainty for Biomedical Data Classification

Bressan Rafael S.; Camargo Guilherme; Bugatti Pedro Henrique; Saito Priscila Tiemi Maeda

首页> 外文期刊>Biomedical and Health Informatics, IEEE Journal of >Exploring Active Learning Based on Representativeness and Uncertainty for Biomedical Data Classification

【24h】

Exploring Active Learning Based on Representativeness and Uncertainty for Biomedical Data Classification

机译：基于代表性和不确定性的主动学习对生物医学数据分类的探索

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Nowadays, there is an abundance of biomedical data, such as images and genetic sequences, among others. However, there is a lack of annotation to such volume of data, due to the high costs involved to perform this task. Thus, it is mandatory to develop techniques to ease the burden of human annotation. To reach such goal active learning strategies can be applied. However, the state-of-the-art active learning methods, generally, are not feasible to lead with real-world datasets. Another important issue, that is generally neglected by these methods, is related to the conception that the classifier tends to learn more and more at each iteration. Their adopted selection criteria do not properly exploit the knowledge of the classifier. Therefore, in this paper, we propose the use of an active learning approach, in order to leverage the learning process, including the proposal of a novel active learning strategy. The main difference of our proposed strategy is related to the participation of the classifier in an extremely active way in its learning process. So, we can better maximize and prioritize the knowledge that is obtained by the classifier at each iteration, making use of this knowledge in a more appropriate and useful way when selecting more informative samples. To do so, in our selection criteria, we give significant importance to the classifications suggested by the classifier. In addition, jointly with the participation and the knowledge of the classifier, we consider both uncertainty and representativeness criteria through a fine-grained analysis of the samples. Experimental results show that our novel active learning approach outperforms state-of-the-art active learning methods, considering several supervised classifiers. Hence, dealing with real dataset problems in a better way, equalizing the tradeoff between annotation task and higher accuracy rates.

机译：如今，有大量的生物医学数据，例如图像和遗传序列等。但是，由于执行此任务的成本较高，因此缺少对此类数据的注释。因此，必须开发减轻人类注释负担的技术。为了达到这样的目标，可以采用主动学习策略。但是，通常，采用最新的主动学习方法来引导现实世界的数据集是不可行的。这些方法通常忽略的另一个重要问题与分类器倾向于在每次迭代中学习越来越多的概念有关。他们采用的选择标准不能正确利用分类器的知识。因此，在本文中，我们建议使用主动学习方法，以利用学习过程，包括提出一种新颖的主动学习策略。我们提出的策略的主要区别在于分类器以一种非常积极的方式参与其学习过程。因此，我们可以更好地最大化和优先化分类器在每次迭代中获得的知识，并在选择更多信息样本时以更适当和有用的方式利用这些知识。为此，在我们的选择标准中，我们非常重视分类器建议的分类。此外，结合分类器的参与和知识，我们通过对样本进行细粒度分析来考虑不确定性和代表性标准。实验结果表明，考虑到多个监督分类器，我们新颖的主动学习方法优于最新的主动学习方法。因此，以更好的方式处理实际数据集问题，使注释任务与更高的准确率之间的权衡平衡。

著录项

来源
《Biomedical and Health Informatics, IEEE Journal of》 |2019年第6期|2238-2244|共7页
作者
Bressan Rafael S.; Camargo Guilherme; Bugatti Pedro Henrique; Saito Priscila Tiemi Maeda;
展开▼
作者单位

Fed Univ Technol Dept Comp BR-86300000 Comelio Procopio Brazil;

Fed Univ Technol Dept Comp BR-86300000 Comelio Procopio Brazil|Univ Estadual Campinas Inst Comp BR-13083970 Campinas SP Brazil;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Uncertainty; Informatics; Learning systems; Support vector machines; Bioinformatics; Training; Task analysis; Biomedical datasets; healthcare; medical diagnosis; data analysis; knowledge discovery; machine learning;

机译：不确定;信息学学习系统;支持向量机;生物信息学训练;任务分析;生物医学数据集;卫生保健;医学诊断;数据分析;知识发现;机器学习;
入库时间 2022-08-18 04:49:08

相似文献

外文文献
中文文献
专利

1. Ensemble Learning with Active Example Selection for Imbalanced Biomedical Data Classification [J] . Oh Sangyoon, Lee Min Su, Zhang Byoung-Tak Computational Biology and Bioinformatics, IEEE/ACM Transactions on . 2011,第2期

机译：集成学习与有效示例选择，实现不平衡生物医学数据分类
2. Jointly Informative and Manifold Structure Representative Sampling Based Active Learning for Remote Sensing Image Classification [J] . Alim Samat, Paolo Gamba, Sicong Liu, IEEE Transactions on Geoscience and Remote Sensing . 2016,第11期

机译：基于联合信息和流形结构代表采样的主动学习的遥感图像分类
3. Uncertainty sampling-based active learning for protein-protein interaction extraction from biomedical literature [J] . Baojin Cui, Hongfei Lin, Zhihao Yang Expert systems with applications . 2009,第7期

机译：基于不确定性采样的主动学习，用于从生物医学文献中提取蛋白质-蛋白质相互作用
4. Representative Region Based Active Learning For Histological Classification Of Colorectal Cancer [C] . Yiqing Shen, Jing Ke IEEE International Symposium on Biomedical Imaging . 2021

机译：基于代表性区域的直肠癌组织学分类的主动学习
5. Exploring the Roles of High School Career and Technical Education Students' Background Characteristics and College Students' Work-Based Learning Experience on Postsecondary Education Outcomes Using Nationally Representative Data [D] . Benjamin, Courtney B. 2018

机译：利用全国代表性的数据探索高中职业技术教育学生的背景特征和大学生基于工作的学习经验对中学教育成果的作用
6. An Active Learning Approach with Uncertainty Representativeness and Diversity [O] . Tianxu He, Shukui Zhang, Jie Xin, -1

机译：具有不确定性代表性和多样性的主动学习方法
7. Ensemble Learning with Active Example Selection for Imbalanced Biomedical Data Classification [O] . Sangyoon Oh, Min Su Lee, Byoung-tak Zhang 2010

机译：集成学习与有效示例选择，实现不平衡生物医学数据分类

Exploring Active Learning Based on Representativeness and Uncertainty for Biomedical Data Classification

摘要

著录项

相似文献

相关主题

期刊订阅