首页> 外文会议>Focused Access to XML Documents >Entity Ranking from Annotated Text Collections Using Multitype Topic Models

【24h】

Entity Ranking from Annotated Text Collections Using Multitype Topic Models

机译：使用多类型主题模型的带注释文本集合中的实体排名

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Very recently, topic model-based retrieval methods have produced good results using Latent Dirichlet Allocation (LDA) model or its variants in language modeling framework. However, for the task of retrieving annotated documents when using the LDA-based methods, some post-processing is required outside the model in order to make use of multiple word types that are specified by the annotations. In this paper, we explore new retrieval methods using a 'multi-type topic model' that can directly handle multiple word types, such as annotated entities, category labels and other words that are typically used in Wikipedia. We investigate how to effectively apply the multitype topic model to retrieve documents from an annotated collection, and show the effectiveness of our methods through experiments on entity ranking using a Wikipedia collection.

机译：最近，在语言建模框架中使用潜在狄利克雷分配（LDA）模型或其变体，基于主题模型的检索方法已产生了良好的效果。但是，对于使用基于LDA的方法时检索带注释的文档的任务，需要在模型外部进行一些后处理，以便使用由注释指定的多个单词类型。在本文中，我们探索了一种使用“多类型主题模型”的新检索方法，该模型可以直接处理多种单词类型，例如带注释的实体，类别标签和Wikipedia中通常使用的其他单词。我们研究如何有效地使用多类型主题模型来从带注释的集合中检索文档，并通过使用Wikipedia集合进行实体排名实验来证明我们的方法的有效性。

著录项

来源
《Focused Access to XML Documents》|2007年|P.279-292|共14页
会议地点 Dagstuhl Castle(DE);Dagstuhl Castle(DE)
作者
Hitohiro Shiozaki; Koji Eguchi;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息处理（信息加工）;
关键词
入库时间 2022-08-26 14:06:49

相似文献

外文文献
中文文献
专利

1. Entity Network Prediction Using Multitype Topic Models [J] . Hitohiro SHIOZAKI, Koji EGUCHI, Takenao OHKAWA IEICE Transactions on Information and Systems . 2008,第11期

机译：使用多类型主题模型的实体网络预测
2. Additive Regularization for Topic Models of Text Collections [J] . K. V. Vorontsov Doklady. Mathematics . 2014,第3期

机译：文本集合主题模型的加性正则化
3. Text Mining For Information Systems Researchers: An Annotated Topic Modeling Tutorial [J] . Stefan Debortoli, Oliver Müller, Iris Junglas, Communications of the Association for Information Systems . 2016,第1期

机译：信息系统研究人员的文本挖掘：带注释的主题建模教程
4. Entity Ranking from Annotated Text Collections Using Multitype Topic Models [C] . Hitohiro Shiozaki, Koji Eguchi International Workshop of the Initiative for the Evaluation of XML Retrieval . 2008

机译：使用MultityPe主题模型从注释的文本集合中排名实体
5. Things and Strings and More: Improving Place Name Disambiguation from Short Texts by Combining Entity Co-Occurrence, Topic Modeling, and Word Embedding [D] . Ju, Yiting. 2017

机译：事物和字符串和更多：通过组合实体共同发生，主题建模和单词嵌入来改善从短文本的歧义
6. MTQA: Text-Based Multitype Question and Answer Reading Comprehension Model [O] . Deguang Chen, Ziping Ma, Lin Wei, 2021

机译：MTQA：基于文本的多立方问题和应答阅读理解模型
7. Block-LDA: Jointly modeling entity-annotated text and entity-entity links [O] . Ramnath Balasubramanyan, William W. Cohen 2011

机译：Block-LDa：联合建模实体注释文本和实体 - 实体链接

Entity Ranking from Annotated Text Collections Using Multitype Topic Models

摘要

著录项

相似文献

相关主题

期刊订阅