首页> 外文会议>International Congress on Digital Heritage >News Search Using Discourse Analytics
【24h】

News Search Using Discourse Analytics

机译:新闻搜索使用话语分析

获取原文

摘要

The vast numbers of digitised documents containing historical data constitute a rich research data repository. However, computational methods and tools available to explore this data are still limited in functionality. Research on historical archives is still largely carried out manually. Text mining technologies offer novel methods to analyse digital content to identify various types of semantic information in these documents and to extract them as semantic metadata. Methods range from the automatic identification of named entities (e.g., people, places, organisations, etc.) to more sophisticated methods to extract information about events (e.g., births, deaths, arrests, etc.), allowing users to greatly increase the specificity of their search. We have created an extended model of event interpretation to allow searches to be refined based on various discourse facets, including isolating definite information about events from more speculative details, distinguishing positive and negative opinions and categorising events according to information source. We present ISHER as an example of a multifaceted, semantically oriented system for searching news articles from the New York Times, dating back to 1987. We explain how our extended event interpretation model can enhance search capabilities in systems such as ISHER, including the identification of contrasting and contradictory information in news articles.
机译:包含历史数据的大量数字化文档构成了丰富的研究数据存储库。但是,可用于探索此数据的计算方法和工具仍然有限于功能。历史档案的研究仍然在很大程度上手动进行。文本挖掘技术提供了分析数字内容的新方法,以确定这些文档中的各种类型的语义信息,并将其提取为语义元数据。方法范围从自动识别指定实体(例如,人,地方,组织等)到更复杂的方法,以提取有关事件的信息(例如,出生,死亡,逮捕等),使用户能够大大增加特殊性他们的搜索。我们已经创建了一个扩展的事件解释模型,以允许根据各种话语方面进行精制搜索,包括从更多投机细节中隔离有关事件的明确信息,包括根据信息来源的正面和负面意见和分类事件。我们将Isher作为一个多方面,语义面向系统的示例,用于从纽约时报搜索新闻文章,约会回到1987年。我们解释了我们的扩展事件解释模型如何增强诸如ISHer的系统中的搜索能力,包括识别新闻文章中的对比和矛盾信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号