首页> 外文会议>Proceedings of the 73rd ASISamp;T annual meeting: navigating streams in an information ecosystems >A Comparative Analysis of User-generated and Authorgenerated Metadata for Web Resources
【24h】

A Comparative Analysis of User-generated and Authorgenerated Metadata for Web Resources

机译:用户生成和授权生成的Web资源元数据的比较分析

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we investigate the difference betweenrnmetadata generated by users and authors. Delicious tags andrnHTML keyword META tags associated with the same setrnof web pages on topics related to semantic web arerncollected, forming two datasets (i.e., Delicious dataset andrnHTML dataset). Comparisons of the two datasets in micrornand macro vocabulary overlap as well as classification ofrnweb pages are analyzed. The results show that (1) overlaprnbetween the two datasets exists; (2) non-overlapped tags inrnDelicious dataset reveal systematic deficiency of socialrntagging systems; non-overlapped tags in HTML datasetrnexpose organization-oriented contents; and (3) Deliciousrndataset tends to cluster web pages according to theirrnpopularity and subject area while HTML dataset clustersrnthe web pages according to different websites/authors.
机译:在本文中,我们调查了用户和作者生成的元数据之间的区别。收集与语义网络相关主题上相同setrnof网页相关联的Delicious标签和html HTML META标签,形成两个数据集(即Delicious数据集和rnHTML数据集)。分析了微观和宏观词汇重叠中两个数据集的比较以及网页的分类。结果表明:(1)两个数据集之间存在重叠。 (2)不重叠的标签:美味的数据集揭示了社交标签系统的系统缺陷; HTML数据集中的非重叠标签-暴露面向组织的内容; (3)Deliciousrn数据集倾向于根据其受欢迎程度和主题区域对网页进行聚类,而HTML数据集则根据不同的网站/作者对网页进行聚类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号