首页> 外文会议>International Conference on Text,Speech and Dialogue(TSD 2004); 20040908-11; Brno(CZ) >Balancing Manual and Automatic Indexing for Retrieval of Paper Abstracts
【24h】

Balancing Manual and Automatic Indexing for Retrieval of Paper Abstracts

机译:平衡论文摘要的手动和自动索引编制

获取原文
获取原文并翻译 | 示例

摘要

MEDLINE is a widely used very large database of abstracts of research papers in medical domain. Abstracts in it are manually supplied with keywords from a controlled vocabulary called MeSH. The MeSH keywords assigned to a specific document are subdivided into MeSH major headings, which express the main topic of the document, and MeSH minor headings, which express additional information about the document's topic. The search engine supplied with MEDLINE uses Boolean retrieval model with only MeSH keywords used for indexing. We show that (1) vector space retrieval model with the full text of the abstracts indexed gives much better results; (2) assigning greater weights to the MeSH keywords than to the terms appearing in the text of the abstracts gives slightly better results, and (3) assigning slightly greater weight to major MeSH terms than to minor MeSH terms further improves the results.
机译:MEDLINE是医学领域广泛使用的非常庞大的研究论文摘要数据库。其中的摘要由称为MeSH的受控词汇中的关键字手动提供。分配给特定文档的MeSH关键字可细分为MeSH主标题(表示文档的主要主题)和MeSH次标题(表示有关文档主题的其他信息)。 MEDLINE随附的搜索引擎使用布尔检索模型,仅将MeSH关键字用于索引。我们证明了:(1)带有摘要全文的向量空间检索模型给出了更好的结果; (2)为MeSH关键字赋予比在摘要文本中出现的术语更大的权重,会得到更好的结果;(3)为主要的MeSH术语赋予的权重比对次要的MeSH术语更大的权重会进一步改善结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号