首页> 外文期刊>WSEAS Transactions on Computers >Web Page Analysis Based on HTML DOM and Its Usage for Forum Statistics, Alerts and Geo Targeted Data Retrieval
【24h】

Web Page Analysis Based on HTML DOM and Its Usage for Forum Statistics, Alerts and Geo Targeted Data Retrieval

机译:基于HTML DOM的网页分析及其在论坛统计,警报和按地理位置定位的数据检索中的用途

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Message boards are part of the Internet known as the 'Invisible Web' and pose many problems to traditional search engine spiders. The dynamic content is usually very deep and difficult to search. In addition, many of these sites change their locations, servers, or URLs almost daily creating problems with the indexing process. However, during the growth of the World Wide Web and with the help of search engines, they represent an important source of information to solve different problems. Another interesting feature of this type of web pages is that a big community has been developed, expressing different opinions and discussing various topics. Using special retrieval and indexing algorithms, mostly based on the HTML DOM tree, we have developed an algorithm to obtain detailed and accurate trend statistics that can be used for different marketing solutions and analysis tools. Combined with the services provided by traffic ranking sites like Alexa.com, we can also provide geo targeting functionality to deliver even more accurate results to the end user, such as what percentage of the users who are visiting a certain forum is coming from a certain country.
机译:留言板是被称为“隐形网”的Internet的一部分,给传统的搜索引擎蜘蛛带来了许多问题。动态内容通常很深并且很难搜索。此外,这些站点中的许多站点几乎每天都会更改其位置,服务器或URL,从而在索引过程中造成问题。但是,在万维网的发展过程中,在搜索引擎的帮助下,它们代表了解决不同问题的重要信息来源。此类网页的另一个有趣特征是,已经开发了一个大型社区,可以表达不同的意见并讨论各种主题。使用主要基于HTML DOM树的特殊检索和索引算法,我们开发了一种算法来获取详细而准确的趋势统计信息,该统计信息可用于不同的营销解决方案和分析工具。结合流量排名网站(如Alexa.com)提供的服务,我们还可以提供地理位置定位功能,以向最终用户提供更准确的结果,例如访问某个论坛的用户中有多少来自某个论坛。国家。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号