首页> 外文期刊>Journal of documentation >Automated classification of web pages in hierarchical browsing
【24h】

Automated classification of web pages in hierarchical browsing

机译:分层浏览中的网页自动分类

获取原文
获取原文并翻译 | 示例
           

摘要

Purpose - The purpose of this study is twofold: to investigate whether it is meaningful to use thernEngineering Index (Ei) classification scheme for browsing, and then, if proven useful, to investigate thernperformance of an automated classification algorithm based on the Ei classification scheme.rnDesign/methodology/approach - A user study was conducted in which users solved fourrncontrolled searching tasks. The users browsed the Ei classification scheme in order to examine thernsuitability of the classification systems for browsing. The classification algorithm was evaluated byrnthe users who judged the correctness of the automatically assigned classes.rnFindings - The study showed that the Ei classification scheme is suited for browsing. Automaticallyrnassigned classes were on average partly correct, with some classes working better than others. Successrnof browsing showed to be correlated and dependent on classification correctness.rnResearch limitations/implications - Further research should address problems of disparaternevaluations of one and the same web page. Additional reasons behind browsing failures in the Eirnclassification scheme also need further investigation.rnPractical implications - Improvements for browsing were identified; describing class captionsrnand/or listing their subclasses from start; allowing for searching for words from class captions withrnsynonym search (easily provided for Ei since the classes are mapped to thesauri terms); whenrnsearching for class captions, returning the hierarchical tree expanded around the class in whichrncaption the search term is found. The need for improvements of classification schemes was alsornindicated.rnOriginality/value - A user-based evaluation of automated subject classification in the context ofrnbrowsing has not been conducted before; hence the study also presents new findings concerningrnmethodology.
机译:目的-这项研究的目的是双重的:研究使用工程索引(Ei)分类方案进行浏览是否有意义,然后,如果证明有用,则研究基于Ei分类方案的自动化分类算法的性能。设计/方法/方法-进行了一项用户研究,其中用户解决了四种受控的搜索任务。用户浏览了Ei分类方案,以检查分类系统是否适合浏览。分类算法由判断自动分配的类的正确性的用户评估。研究结果-研究表明Ei分类方案适合浏览。平均而言,自动重新分配的类在某种程度上是正确的,其中某些类的效果优于其他类。研究表明,Successrnof浏览是相关的,并取决于分类的正确性。研究局限/含义-进一步的研究应解决对同一网页的歧义评估问题。 Eirn分类方案中浏览失败的其他原因也需要进一步调查。实用意义-确定了浏览的改进;从一开始就描述类标题和/或列出其子类;允许使用同义词搜索从类标题中搜索单词(由于类被映射到叙词表,因此很容易为Ei提供);在搜索类标题时,返回在其中找到搜索词的类周围展开的层次树。还指出了改进分类方案的必要性。原始性/价值-以前从未在浏览环境下进行过基于用户的自动主题分类评估;因此,该研究还提出了有关方法学的新发现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号