首页> 美国卫生研究院文献>Nature Public Health Emergency Collection >Simple Semantics in Topic Detection and Tracking
【2h】

Simple Semantics in Topic Detection and Tracking

机译:主题检测和跟踪中的简单语义

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Topic Detection and Tracking (TDT) is a research initiative that aims at techniques to organize news documents in terms of news events. We propose a method that incorporates simple semantics into TDT by splitting the term space into groups of terms that have the meaning of the same type. Such a group can be associated with an external ontology. This ontology is used to determine the similarity of two terms in the given group. We extract proper names, locations, temporal expressions and normal terms into distinct sub-vectors of the document representation. Measuring the similarity of two documents is conducted by comparing a pair of their corresponding sub-vectors at a time. We use a simple perceptron to optimize the relative emphasis of each semantic class in the tracking and detection decisions. The results suggest that the spatial and the temporal similarity measures need to be improved. Especially the vagueness of spatial and temporal terms needs to be addressed.
机译:主题检测和跟踪(TDT)是一项研究计划,旨在根据新闻事件组织新闻文档的技术。我们提出了一种通过将术语空间分为具有相同类型含义的术语组来将简单语义纳入TDT的方法。这样的组可以与外部本体相关联。该本体用于确定给定组中两个术语的相似性。我们将专有名称,位置,时间表达和规范术语提取到文档表示形式的不同子向量中。通过一次比较一对相应的子向量来测量两个文档的相似性。我们使用简单的感知器来优化每个语义类在跟踪和检测决策中的相对强调。结果表明,空间和时间的相似性措施有待改进。尤其需要解决时空术语的模糊性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号