【24h】

The Aboutness of Words

机译:话语的亲切感

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Word aboutness is defined as the relationship between words and subjects associated with them. An aboutness coefficient is developed to estimate the strength of the aboutness relationship. Words that are randomly distributed across subjects are assumed to lack aboutness and the degree to which their usage deviates from a random pattern indicates the strength of the aboutness. To estimate aboutness, title words and their associated subjects are extracted from the titles of non-fiction English language books in the OCLC WorldCat database. The usage patterns of the title words are analyzed and used to compute aboutness coefficients for each of the common title words. Words with low aboutness coefficients (An and In) are commonly found in stop word lists, whereas words with high aboutness coefficients (Carbonate, Autism) are unambiguous and have a strong subject association. The aboutness coefficient potentially can enhance indexing, advance authority control, and improve retrieval.
机译:单词相关性定义为单词与与其相关联的主题之间的关系。发展了亲近度系数以估计亲近度关系的强度。假定在主题之间随机分布的单词缺乏关联性,并且其用法偏离随机模式的程度表明了关联性的强度。为了估计相关性,从OCLC WorldCat数据库中非小说类英语书籍的标题中提取标题词及其相关主题。分析标题词的使用模式,并将其用于计算每个常见标题词的接近度系数。关联系数低的单词(An和In)通常在停用词列表中找到,而关联系数高的单词(碳酸盐,自闭症)则毫不含糊且主题关联性强。关联系数可能会增强索引编制,推进权限控制并改善检索。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号