【24h】

Blogger, stick to your story

机译:博主,坚持你的故事

获取原文

摘要

Noise in text can be defined as any kind of difference between the surface form of a coded representation of the text and the intended, correct, or original text. By its very nature, noisy text warrants moving beyond traditional text analytics techniques. Noise introduces challenges that need special handling, either through new methods or improved versions of existing ones. After the highly successful AND 2007, that was part of IJCAI 07, in this second edition that is part of SIGIR 08, the Information Retrieval community has added its perspective to this topic. The goal of the AND workshops is to focus on the problems encountered in analyzing noisy documents coming from various sources. This workshop brought together a diverse group of researchers to present current research and development in addressing this challenge. >We were fortunate to assemble a diverse group of researchers from the Natural Language Processing, Machine Learning and Knowledge Management communities to help usin organizing this workshop. The workshop call for papers had a very good response. We received 25 submissions spanning a diverse set of issues relevant to noisy text analytics. Each submission was reviewed by at least three members of the program committee. Finally twelve papers were selected for oral and four for poster presentation. >To encourage discussion, the workshop program was structured into topic-oriented oral and poster sessions. In addition to the contributed papers, the program also contained a keynote address by Donna Harman, NIST, an invited talk by John Tait, IRF, and discussion sessions spread through the day.
机译:文本中的噪声可以定义为文本编码表示的表面形式和预期,正确或原始文本之间的表面形式之间的任何类型。通过其本质,嘈杂的文本认证超越传统文本分析技术。噪音引入了需要特殊处理的挑战,通过新方法或改进现有版本。在高度成功和2007年之后,这是IJCAI 07的一部分,在这第二版是Sigir 08的一部分,信息检索社区已向这一主题添加了它的观点。和研讨会的目标是专注于分析来自各种来源的嘈杂文件的问题。该研讨会汇集了各种各样的研究人员,在解决这一挑战方面提出了当前的研究和发展。 >我们很幸运能够从自然语言加工,机器学习和知识管理社区组装多样化的研究人员帮助我们组织这个研讨会。研讨会呼吁论文有一个非常好的回应。我们收到了25份涵盖了与嘈杂的文本分析相关的多样化问题。每次提交都由计划委员会的至少三名成员审查。最后选择了12篇论文的口头和四个海报演示文稿。为了鼓励讨论,研讨会计划被构建为面向主题的口头和海报会议。除了贡献的论文之外,该计划还包含了Donna Harman,NIST的主题演讲,由John Tait,IRF,讨论会参加当天的讨论会议。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号