首页> 外文会议>2017 Intelligent Systems Conference >Improving open-domain event schema discovery with casual english normalization for noisy text
【24h】

Improving open-domain event schema discovery with casual english normalization for noisy text

机译:通过对嘈杂的文本进行随意的英语标准化来改善开放域事件模式的发现

获取原文
获取原文并翻译 | 示例

摘要

Social media enable people to share significant events from their daily life. Social data mining evolves the challenge of dealing with casual language extraction due to the unstructured social media content: social media users often prefer communicating unconventionally with informal language using abbreviations, slang, misspelled words, or non-standard short-forms. Thereby, this paper proposes a new open-domain event schema discovery approach using casual language normalization to normalize, extract events and discover their adequate schemas (event types and argument roles) from noisy corpus. The proposed approach exploits casual language normalization to improve both tasks of event schema discovery and event extraction. This approach can automatically normalize and generate high-quality schemas from the extracted events with unknown types. The introduced approach promises better results in terms of accuracy and quality of the discovered schemas.
机译:社交媒体使人们可以分享日常生活中的重大事件。社交数据挖掘由于结构化的社交媒体内容而带来了处理休闲语言提取的挑战:社交媒体用户通常更喜欢使用缩写,语,拼写错误的单词或非标准的简短形式与非正式语言进行非常规的交流。因此,本文提出了一种新的开放域事件模式发现方法,该方法使用休闲语言规范化来规范,提取事件并从嘈杂的语料库中发现事件的适当模式(事件类型和参数角色)。所提出的方法利用休闲语言规范化来改善事件模式发现和事件提取的任务。这种方法可以从提取的未知类型事件中自动规范化并生成高质量模式。引入的方法有望在发现的模式的准确性和质量方面带来更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号