NELasso: Group-Sparse Modeling for Characterizing Relations Among Named Entities in News Articles

Amara Tariq; Asim Karim; Hassan Foroosh

首页> 外文期刊>IEEE Transactions on Pattern Analysis and Machine Intelligence >NELasso: Group-Sparse Modeling for Characterizing Relations Among Named Entities in News Articles

【24h】

NELasso: Group-Sparse Modeling for Characterizing Relations Among Named Entities in News Articles

机译：NELasso：用于描述新闻文章中命名实体之间关系的群体稀疏建模

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Named entities such as people, locations, and organizations play a vital role in characterizing online content. They often reflect information of interest and are frequently used in search queries. Although named entities can be detected reliably from textual content, extracting relations among them is more challenging, yet useful in various applications (e.g., news recommending systems). In this paper, we present a novel model and system for learning semantic relations among named entities from collections of news articles. We model each named entity occurrence with sparse structured logistic regression, and consider the words (predictors) to be grouped based on background semantics. This sparse group LASSO approach forces the weights of word groups that do not influence the prediction towards zero. The resulting sparse structure is utilized for defining the type and strength of relations. Our unsupervised system yields a named entities' network where each relation is typed, quantified, and characterized in context. These relations are the key to understanding news material over time and customizing newsfeeds for readers. Extensive evaluation of our system on articles from TIME magazine and BBC News shows that the learned relations correlate with static semantic relatedness measures like WLM, and capture the evolving relationships among named entities over time.

机译：诸如人物，位置和组织之类的具名实体在表征在线内容中起着至关重要的作用。它们通常反映出感兴趣的信息，并经常用于搜索查询中。尽管可以从文本内容中可靠地检测到命名实体，但是提取它们之间的关系更具挑战性，但是在各种应用程序（例如新闻推荐系统）中很有用。在本文中，我们提出了一种新颖的模型和系统，用于从新闻文章集中学习命名实体之间的语义关系。我们使用稀疏结构化逻辑回归对每个命名实体出现进行建模，并考虑根据背景语义对单词（预测变量）进行分组。这种稀疏的组LASSO方法将不影响预测的词组的权重强制为零。所得的稀疏结构用于定义关系的类型和强度。我们的无人监督系统产生了一个命名实体的网络，在该网络中，每个关联都在上下文中被键入，量化和表征。这些关系是了解一段时间内新闻材料和为读者自定义新闻源的关键。对《时代》杂志和《英国广播公司新闻》的文章对我们的系统进行的广泛评估表明，学习到的关系与静态语义相关性度量（例如WLM）相关联，并随着时间的推移捕获了命名实体之间不断发展的关系。

著录项

来源
《IEEE Transactions on Pattern Analysis and Machine Intelligence》 |2017年第10期|2000-2014|共15页
作者
Amara Tariq; Asim Karim; Hassan Foroosh;
展开▼
作者单位

Department of Computer Science, Forman Christian College, Pakistan;

Department of Information Technology, Lahore University of Management and Sciences, Lahore, Pakistan;

Department of Computer Science, University of Central Florida, Orlando, FL;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Semantics; Context; Encyclopedias; Electronic publishing; Internet; Vocabulary;

机译：语义;语境;百科全书;电子出版;互联网;词汇;
入库时间 2022-08-17 13:38:42

相似文献

外文文献
中文文献
专利

1. Named Entity Oriented Difference Analysis of News Articles and Its Application [J] . Keisuke KIRITOSHI, Qiang MA IEICE transactions on information and systems . 2016,第4期

机译：新闻文章的面向命名实体的差异分析及其应用
2. Automatic discovery of person-related named-entity in news articles based on verb analysis [J] . Goh Hui-Ngo, Soon Lay-Ki, Haw Su-Cheng Multimedia Tools and Applications . 2015,第8期

机译：基于动词分析的新闻文章中与人相关的命名实体自动发现
3. A Joint Neural Model for Fine-Grained Named Entity Classification of Wikipedia Articles [J] . Masatoshi SUZUKI, Koji MATSUDA, Satoshi SEKINE, IEICE transactions on information and systems . 2018,第1期

机译：Wikipedia文章的细粒度命名实体分类的联合神经网络模型
4. Comparison of Named Entity Recognition Tools Applied to News Articles [C] . Sergey Vychegzhanin, Evgeny Kotelnikov Ivannikov Ispras Open Conference . 2019

机译：新闻文章中使用的命名实体识别工具的比较
5. Learning for information extraction: From named entity recognition and disambiguation to relation extraction. [D] . Bunescu, Razvan Constantin. 2007

机译：学习信息提取：从命名实体识别和歧义消除到关系提取。
6. A method for named entity normalization in biomedical articles: application to diseases and plants [O] . Hyejin Cho, Wonjun Choi, Hyunju Lee 2017

机译：生物医学物品中命名实体标准化的方法：在疾病和植物中的应用
7. Named Entity Oriented Difference Analysis of News Articles and Its Application [O] . KIRITOSHI, Keisuke, MA, Qiang 2016

机译：新闻文章的命名实体导向差异分析及其应用

NELasso: Group-Sparse Modeling for Characterizing Relations Among Named Entities in News Articles

摘要

著录项

相似文献

相关主题

期刊订阅