Hybrid Approach to Web Content Outlier Mining Without Query Vector

机译：Hybrid方法在没有查询向量的没有查询传染媒介

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Mining outliers from large datasets is like finding needles in a haystack. Even more challenging is sifting through the dynamic, unstructured, and ever-growing web data for outliers. This paper presents HyCOQ, which is a hybrid algorithm that draws from the power of n-gram-based and word-based systems. Experimental results obtained using embedded motifs without a dictionary show significant improvement over using a domain dictionary irrespective of the type of data used (words, n-grams, or hybrid). Also, there is remarkable improvement in recall with hybrid documents compared to using raw words and n-grams without a domain dictionary.

机译：来自大型数据集的挖掘异常值就像在干草堆中找到针头。甚至更具挑战性正在通过用于异常值的动态，非结构化和不断增长的Web数据来实现筛选。本文呈现Hycoq，它是一种混合算法，其从基于N-GRAM的基于单词的系统中汲取的混合算法。使用没有字典的嵌入式图案获得的实验结果显示了使用域字典的显着改进，而不管使用的数据类型（单词，n-gram或混合）。此外，与在没有域字典的未经域字典的原始单词和n-grams相比，召回具有显着的改进。

著录项

来源
《International Conference on Data Warehousing and Knowledge Discovery》|2005年||共10页
会议地点
作者
Malik Agyemang; Ken Barker; Reda Alhajj;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词

相似文献

外文文献
中文文献
专利

1. A Mathematical Approach for Mining Web Content Outliers using Term Frequency Ranking [J] . S. Sathya Bama, M. S. Irfan Ahmed, A. Saravanan Indian Journal of Science and Technology . 2015,第14期

机译：使用术语频率排序来挖掘Web内容异常值的数学方法
2. Position Score Weighting Technique for Mining Web Content Outliers [J] . W.R. Wan Zulkifeli, N. Mustapha, A. Mustapha International Journal of Applied Mathematics & Statistics . 2013,第6期

机译：位置得分加权技术，用于挖掘Web内容异常值
3. OWA Operator-Based Hybrid Framework for Outlier Reduction in Web Mining [J] . Ankit Gupta, Shruti Kohli International journal of entelligent systems . 2016,第10期

机译：基于OWA运算符的Web挖掘中减少异常值的混合框架
4. Hybrid Approach to Web Content Outlier Mining Without Query Vector [C] . International Conference on Data Warehousing and Knowledge Discovery(DaWaK 2005) . 2005

机译：Hybrid方法在没有查询向量的没有查询传染媒介
5. Web based content and hybrid teaching: Student perceptions of the effectiveness of using web based content and hyper-linked teaching units in teaching hybrid business and marketing post secondary classes. [D] . Richardson, W. Tim G. 2007

机译：基于Web的内容和混合教学：学生对使用基于Web的内容和超链接教学单元在混合商务和市场营销中学后课程教学中的有效性的看法。
6. Effective Filtering of Query Results on Updated User Behavioral Profiles in Web Mining [O] . S. Sadesh, R. C. Suganthe 2015

机译：在Web挖掘中对更新的用户行为配置文件上的查询结果进行有效过滤
7. Mining Translations of Chinese Names from Web Corpora Using a Query Expansion Technique and Support Vector Machine [O] . 2015

机译：利用查询扩展技术和支持向量机从Web语料库挖掘中文名称的翻译

Hybrid Approach to Web Content Outlier Mining Without Query Vector

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅