首页> 外文期刊>Computing and informatics >Text Categorization and Sorting of Web Search Results
【24h】

Text Categorization and Sorting of Web Search Results

机译:文本分类和Web搜索结果的排序

获取原文
       

摘要

With the Internet facing the growing problem of information overload, the large volumes, weak structure and noisiness of Web data make it amenable to the application of machine learning techniques. After providing an overview of several topics in text categorization, including document representation, feature selection, and a choice of classifiers, the paper presents experimental results concerning the performance and effects of different transformations of the bag-of-words document representation and feature selection, on texts extracted from the dmoz Open Directory of Web pages. Finally, the paper describes the primary motivation for the experiments: a new meta-search engine CatS which utilizes text categorization to enhance the presentation of search results obtained from a major Web search engine.
机译:随着Internet面临着越来越多的信息过载问题,Web数据的大容量,弱结构和嘈杂使其适合于机器学习技术的应用。在概述了文本分类中的几个主题(包括文档表示,特征选择和分类器选择)后,本文提供了有关词袋文档表示和特征选择的不同转换的性能和效果的实验结果,从dmoz网页开放目录中提取的文本。最后,本文描述了实验的主要动机:一个新的元搜索引擎CatS,它利用文本分类来增强从主要Web搜索引擎获得的搜索结果的表示形式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号