首页> 外文学位 >Entity Extraction and Disambiguation in Short Text Using Wikipedia and Semantic User Profiles.
【24h】

Entity Extraction and Disambiguation in Short Text Using Wikipedia and Semantic User Profiles.

机译:使用Wikipedia和语义用户配置文件在短文本中提取和消除歧义。

获取原文
获取原文并翻译 | 示例

摘要

We focus on entity extraction and disambiguation in short text communications, which have experienced some advances in the last decade, but to this day remain very challenging. Much of the research that has helped advance the field has leveraged crowd-sourced, external knowledge bases like Wikipedia to build probabilistic and machine learning models for entity extraction. That work has its basis in Wikify! and has recently been applied to understanding the topics discussed on social media where a terse, lossy form of communication makes topic detection even more challenging. We expand on this work and show that on the Twitter data experiments we conducted that leveraging a rich, semantic history of entities that users discuss can improve the accuracy of semantically annotating their future social media posts.
机译:我们专注于短文本通信中的实体提取和消歧,在过去的十年中已经取得了一些进展,但是直到今天仍然非常具有挑战性。推动该领域发展的许多研究都利用了众包的外部知识库(如Wikipedia)来建立用于实体提取的概率和机器学习模型。这项工作在Wikify中有其基础!并且最近已被用于理解社交媒体上讨论的主题,而简短,有损的交流形式使主题检测变得更具挑战性。我们对这项工作进行了扩展,并表明在Twitter数据实验中,我们进行了充分利用用户讨论的实体的丰富语义历史记录,可以提高语义注释其未来社交媒体帖子的准确性。

著录项

  • 作者

    Zendejas, Ignacio.;

  • 作者单位

    University of California, Los Angeles.;

  • 授予单位 University of California, Los Angeles.;
  • 学科 Computer Science.
  • 学位 M.S.
  • 年度 2014
  • 页码 66 p.
  • 总页数 66
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:53:38

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号