Dealing with highly imbalanced textual data gathered into similar classes

机译：处理收集到相似类中的高度不平衡的文本数据

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper deals with a new feature selection and feature contrasting approach for classification of highly imbalanced textual data with a high degree of similarity between associated classes. An example of such classification context is illustrated by the task of classifying bibliographic references into a patent classification scheme. This task represents one of the domains of investigation of the QUAERO project, with the final goal of helping experts to evaluate upcoming patents through the use of related research.

机译：本文研究了一种新的特征选择和特征对比方法，用于对高度不平衡的文本数据进行分类，并在相关类之间具有高度相似性。通过将书目参考文献分类为专利分类方案的任务来说明这种分类上下文的一个示例。该任务代表了QUAERO项目的研究领域之一，其最终目标是帮助专家通过使用相关研究评估即将到来的专利。

著录项

来源
《International Joint Conference on Neural Networks》|2013年|1-7|共7页
会议地点
作者
Lamirel Jean-Charles;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Multi-view learning-based data proliferator for boosting classification using highly imbalanced classes [J] . Graa Olfa, Rekik Islem Journal of Neuroscience Methods . 2019,第期

机译：基于多视图的基于学习的数据增殖器，用于使用高度不平衡的类提升分类
2. Resampling imbalanced data to detect fake reviews using machine learning classifiers and textual-based features [J] . Budhi Gregorius Satia, Chiong Raymond, Wang Zuli Multimedia Tools and Applications . 2021,第9期

机译：使用机器学习分类器和基于文本的功能重新采样不平衡数据以检测假审查
3. Ensemble learning by means of a multi-objective optimization design approach for dealing with imbalanced data sets [J] . Ribeiro Victor Henrique Alves, Reynoso-Meza Gilberto Expert systems with applications . 2020,第Juna期

机译：通过用于处理不平衡数据集的多目标优化设计方法来学习
4. Dealing with highly imbalanced textual data gathered into similar classes [C] . Lamirel Jean-Charles International Joint Conference on Neural Networks . 2013

机译：处理高度不平衡的文本数据收集到类似的类
5. Improving Real-Time Intrusion Detection in Dynamic Networks with Highly Imbalanced Data Using a Multi-Agent Architecture Approach [D] . Rucker, Anh-Hong Nguyen. 2019

机译：使用多智能体架构方法提高具有高度不平衡数据的动态网络中的实时入侵检测
6. Comparison between Statistical Models and Machine Learning Methods on Classification for Highly Imbalanced Multiclass Kidney Data [O] . Bomi Jeong, Hyunjeong Cho, Jieun Kim, 2020

机译：高度不平衡的多类肾脏数据分类的统计模型与机器学习方法的比较
7. Table 1: Summary of initial data downloaded from each of four biodiversity data portals for Mexican vertebrate classes, and the relative redundancy of records in each, at the level of species × time (year, month, day) × place (geographic coordinates, textual descriptions). [O] . -1

机译：表1：从墨西哥脊椎动物类别中的四个生物多样性数据门户网站和各自的记录的相对冗余摘要摘要，在物种×时间（年，月，日）×地方（地理坐标，文本的地理坐标描述）。

Dealing with highly imbalanced textual data gathered into similar classes

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅