【24h】

Automated Article Links Identification for Web-based Online Medical Journals

机译:基于Web的在线医学期刊的自动文章链接识别

获取原文
获取原文并翻译 | 示例

摘要

As part of research into Web-based document analysis including Web page downloading and classification, an algorithm has been developed to automatically identify article links in Web-based online journals. This algorithm is based on feature vectors calculated from attributes and contents of links extracted from HTML files, and an instance-based learning algorithm using a nearest neighbor methodology to identify article links. The performance of the algorithm has been evaluated using a sample size of several thousand HTML links of Web-based medical journals. Evaluation shows that the algorithm is capable of identifying article links at an accuracy greater than 99 %.
机译:作为基于Web的文档分析(包括Web页面下载和分类)研究的一部分,已经开发了一种算法,用于自动识别基于Web的在线期刊中的文章链接。此算法基于从HTML文件中提取的链接的属性和内容计算出的特征向量,以及基于实例的学习算法,该算法使用最近邻居方法来识别文章链接。使用基于Web的医学期刊的数千个HTML链接的样本大小对算法的性能进行了评估。评估表明,该算法能够以大于99%的精度识别商品链接。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号