首页> 外文会议>Workshop on biomedical natural language processing >Recognizing sublanguages in scientific journal articles through closure properties
【24h】

Recognizing sublanguages in scientific journal articles through closure properties

机译:通过封闭属性识别科学期刊文章中的亚语言

获取原文

摘要

It has long been realized that sublanguages are relevant to natural language processing and text mining. However, practical methods for recognizing or characterizing them have been lacking. This paper describes a publicly available set of tools for sublanguage recognition. Closure properties are used to assess the goodness of fit of two biomedical corpora to the sublanguage model. Scientific journal articles are compared to general English text, and it is shown that the journal articles fit the sublanguage model, while the general English text does not. A number of examples of implications of the sublanguage characteristics for natural language processing are pointed out. The software is made publicly available at [edited for anonymization].
机译:它已经很久意识到,子语言与自然语言处理和文本挖掘有关。但是,缺乏识别或表征它们的实用方法。本文介绍了用于子语识别的公开组织工具集。封闭性能用于评估两个生物医学Corpora对子宫内模型的良好性。将科学期刊文章与一般英语文本进行比较,并显示期刊文章符合子语言模型,而一般英语文本则不会。指出了自然语言处理的子语言特征的许多含义的示例。该软件在[编辑匿名化]中公开可用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号