Entity-Level Stream Classification: Exploiting Entity Similarity to Label the Future Observations Referring to an Entity

机译：实体级流分类：利用实体相似性来标记引用实体的未来观察

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Stream classification algorithms traditionally treat arriving observations as independent. However, in many applications the arriving examples may depend on the "entity" that generated them, e.g. in product reviewing or in the interactions of users with an application server. In this study, we investigate the potential of this dependency by partitioning the original stream of observations into entity-centric substreams and by incorporating entity-specific information into the learning model. We propose a k Nearest Neighbour inspired stream classification approach (kNN), in which the label of an arriving observation is predicted by exploiting knowledge on the observations belonging to this entity and to entities similar to it. For the computation of entity similarity, we consider knowledge about the observations and knowledge about the entity, potentially transferred from another domain. To distinguish between cases where this kind of knowledge transfer is beneficial for stream classification and cases where the knowledge on the entities does not contribute to classifying the observations, we also propose a heuristic approach based on random sampling of substreams using k Random Entities (kRE). Our learning scenario is not fully supervised: after acquiring labels for the initial few observations of each entity, we assume that no additional labels arrive, and attempt to predict the labels of near-future and far-future observations from that initial seed. We report on our findings from three datasets.

机译：传统上，流分类算法将到达的观测视为独立的。但是，在许多应用中，到达的示例可能取决于产生它们的“实体”，例如在产品审查中或在用户与应用程序服务器的交互中。在这项研究中，我们通过将原始观察流划分为以实体为中心的子流并将特定于实体的信息合并到学习模型中来研究这种依赖性的可能性。我们提出了一种k最近邻启发式流分类方法（kNN），其中通过利用对属于该实体以及与之相似的实体的观测的知识来预测到达的观测的标签。为了计算实体相似度，我们考虑了有关观测的知识和有关实体的知识，这些知识可能是从另一个领域转移过来的。为了区分这种知识转移有利于流分类的情况和有关实体的知识无助于对观察结果进行分类的情况，我们还提出了一种启发式方法，该方法基于k个随机实体（kRE）对子流进行随机采样。我们的学习场景并未得到完全监督：在获取每个实体的最初几个观察值的标签之后，我们假设没有其他标签到达，并尝试从该初始种子中预测近期和远期观察的标签。我们报告来自三个数据集的发现。

著录项

来源
《IEEE International Conference on Data Science and Advanced Analytics》|2018年|246-255|共10页
会议地点
作者
Vishnu Unnikrishnan; Christian Beyer; Pawel Matuszyk; Uli Niemann; Rüdiger Pryss; Winfried Schlee; Eirini Ntoutsi; Myra Spiliopoulou;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Data models; Prediction algorithms; Task analysis; Deep learning; Predictive models; Computational modeling; Diabetes;

机译：数据模型;预测算法;任务分析;深度学习;预测模型;计算模型;糖尿病;

相似文献

外文文献
中文文献
专利

1. Entity-level stream classification: exploiting entity similarity to label the future observations referring to an entity [J] . Vishnu Unnikrishnan, Christian Beyer, Pawel Matuszyk, International Journal of Data Science and Analytics . 2020,第1期

机译：实体级流分类：利用实体相似性来标记涉及实体的未来观察结果
2. Entity-Sensitive Attention and Fusion Network for Entity-Level Multimodal Sentiment Classification [J] . Jianfei Yu, Jing Jiang, Rui Xia Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2020,第期

机译：实体敏感的关注和实体级多模式情绪分类的融合网络
3. Entity-Level Classification of Adverse Drug Reaction: A Comparative Analysis of Neural Network Models [J] . Alimova I. S., Tutubalina E. V Programming and Computer Software . 2019,第8期

机译：不良药物反应的实体水平分类：神经网络模型的比较分析
4. Entity-Level Stream Classification: Exploiting Entity Similarity to Label the Future Observations Referring to an Entity [C] . Vishnu Unnikrishnan, Christian Beyer, Pawel Matuszyk, IEEE International Conference on Data Science and Advanced Analytics . 2019

机译：实体级流分类：利用实体相似性以标记指示实体的未来观察
5. Task mapping and remapping strategies for parallel entity-level simulations. [D] . Su, Alan I. 2003

机译：并行实体级模拟的任务映射和重新映射策略。
6. DTranNER: biomedical named entity recognition with deep learning-based label-label transition model [O] . S. K. Hong, Jae-Gil Lee 2020

机译：DTranNER：具有基于深度学习的标签-标签转换模型的生物医学命名实体识别
7. Using Character-Level and Entity-Level Representations to Enhance Bidirectional Encoder Representation From Transformers-Based Clinical Semantic Textual Similarity Model: ClinicalSTS Modeling Study (Preprint) [O] . Ying Xiong, Shuai Chen, Qingcai Chen, 2020

机译：使用字符级和实体级别表示来增强基于变压器的临床语义文本相似性模型的双向编码器表示：临床电脑建模研究（预印）
8. Topical Acne Drug Products for Over-the-Counter Human Use Revision of Labeling and Classification of Benzoyl Peroxide as Safe and Effective Small Entity Compliance Guide. Guidance for Industry. [R] . 2011

机译：用于非处方人用的局部痤疮药物产品修订过氧化苯甲酰的标签和分类为安全有效的小实体合规指南。工业指南。

Entity-Level Stream Classification: Exploiting Entity Similarity to Label the Future Observations Referring to an Entity

摘要

著录项

相似文献

相关主题

期刊订阅