首页> 外文会议>Intelligent Systems Design and Applications, 2005. ISDA '05. Proceedings. 5th International Conference on >Improving the Web text content by extracting significant pages into a Web site
【24h】

Improving the Web text content by extracting significant pages into a Web site

机译:通过将大量页面提取到网站来改善Web文本内容

获取原文

摘要

Web systems have reached a very important role in today's business world. Every day organizations fight to keep their present clients and to gain new ones. In order to accomplish this goal it is very important to make precise changes in the Web site content. However, the development of these improvements is a complex and specialized task because of the nature of the Web data itself. We propose a novel approach to successfully make changes to improve the Web site content using text mining. We use a self organizing feature map (SOFM) to find the most relevant text content, and then we propose a reverse clustering analysis in order to extract the most significant pages of the whole Web site. The effectiveness of this method was experimentally tested in a real Web site.
机译:Web系统已经在当今的商业世界中扮演了非常重要的角色。组织每天都在努力保持现有客户并获得新客户。为了实现此目标,在网站内容中进行精确更改非常重要。但是,由于Web数据本身的性质,开发这些改进是一项复杂而专门的任务。我们提出了一种新颖的方法来成功进行更改以使用文本挖掘来改善网站内容。我们使用自组织特征图(SOFM)查找最相关的文本内容,然后提出反向聚类分析以提取整个网站的最重要页面。该方法的有效性已在真实的网站上进行了实验测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号