首页> 外文期刊>Data mining and knowledge discovery >Modeling user interests from web browsing activities
【24h】

Modeling user interests from web browsing activities

机译:从Web浏览活动建模用户兴趣

获取原文
获取原文并翻译 | 示例
           

摘要

Browsing sessions are rich in elements useful to build profiles of user interests, but at the same time HTML pages include noisy data such as advertisements, navigation menus and privacy notes. Moreover, some pages cover several different topics making it difficult to identify the most relevant to the user. For these reasons, they are often ignored by personalized search and recommender systems. We propose a novel approach for recognizing valuable text descriptions of current user information needs-namely cues-based on the data mined from browsing interactions over the web. The approach combines page clustering techniques based on Document Object Model-based representations for acquiring evidence about relevant correlations between text contents. This evidence is exploited for better filtering out irrelevant information and facilitating the construction of interest profiles. A comparative framework proves the accuracy of the extracted cues in the personalize search task, where results are re-ranked according to the last browsed resources.
机译:浏览会话具有丰富的元素,可以构建用户兴趣的简档,但同时HTML页面包括广告,导航菜单和隐私记等噪声数据。此外,一些页面涵盖了几个不同的主题,使得难以识别与用户最相关的主题。由于这些原因,它们通常被个性化搜索和推荐系统忽略。我们提出了一种新的方法,用于识别当前用户信息需要的有价值的文本描述 - 即提示基于从Web浏览交互所开采的数据。该方法基于基于文档对象模型的表示来组合页面群集技术,以获取有关文本内容之间相关相关性的证据。这种证据可以利用,以便更好地过滤出无关信息并促进利息概况的构建。比较框架在个性化搜索任务中证明了提取的提示的准确性,其中结果根据上次浏览资源重新排名。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号