首页> 外文会议>International Conference on MEMS, NANO and Smart Systems >Research on the Application of Web Mining Technique based on XML for Unstructured Web Data Using LINQ
【24h】

Research on the Application of Web Mining Technique based on XML for Unstructured Web Data Using LINQ

机译:基于XML对非结构化Web数据的网挖技术应用的研究

获取原文

摘要

Web data mining is a field that has gained popularity in the recent time with the advancement in web mining technologies. Web data mining is the extraction of data on web. The term Web Data Mining is a technique used to crawl through various web resources to collect required information, which enables an individual or a company to promote business, understanding marketing dynamics, new promotions floating on the Internet, etc. The data on web is unstructured, irregular and lacks a fixed unified pattern as it is presented in HTML format that represents data in the presentation format and is unable to handle semi-structured or unstructured data . These difficulties lead to the emergence of XML based web data mining. XML was created so that richly structured documents could be used over the web.XML provides a standard for the data exchange and data storage .This paper presents a web data mining model based on XML. In this model first of all unstructured data is transformed to XML and then XML document is stored in database in the form of the string tree, then specific records are searched using a LINQ query. If record does not exist in the database then check the updates of specific website and repeat the same steps. At last data selected by LINQ Query is displayed on web browser. The feature that helped to increase the speed of data extraction and that also reduces the time of extraction is the presence of database that stores the data that have been extracted earlier by a user and can be used by other users by passing a LINQ query .In this model there is no need to create an extra separate XSL file because this model stores xml document in the database in the form of the string tree. This model is implemented using C# with XML.
机译:Web数据挖掘是一个领域,最近在网站挖掘技术的进步时获得了普及。 Web数据挖掘是在Web上提取数据。术语数据挖掘是一种用于通过各种网络资源爬行以收集所需信息的技术,这使得个人或公司能够促进业务,了解营销动态,浮动互联网上的新促销等。网上的数据是非结构化的,不规则且缺少固定的统一模式,因为它以HTML格式呈现,表示表示格式的数据,并且无法处理半结构化或非结构化数据。这些困难导致了基于XML的Web数据挖掘的出现。创建了XML,以便在Web.xml上使用丰富的结构化文档为数据交换和数据存储提供了标准。这篇论文介绍了基于XML的Web数据挖掘模型。在此模型中,首先将所有非结构化数据转换为XML,然后XML文档以字符串树的形式存储在数据库中,则使用LINQ查询搜索特定记录。如果在数据库中不存在记录,则检查特定网站的更新并重复相同的步骤。在Web浏览器上显示LINQ查询选择的最后数据。有助于提高数据提取速度的功能以及还减少了提取的时间是数据库的存在,该数据库存储用户早期提取的数据,并且可以通过传递LINQ查询来使用其他用户。此模型不需要创建额外的单独XSL文件,因为此模型以字符串树的形式存储在数据库中的XML文档。此模型使用C#与XML实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号