如何在信息量巨大的互联网上准确获取并长期跟踪用户关注的内容,是数据采集和挖掘的重要方面.探讨Web数据采集理论及其应用技术,给出一个半自动采集模型,设计基于旅游业数据的采集系统,验证数据半自动采集的可行性.%It is an important aspect of data extraction and mining that how to exactly gain and chronically trace the content regarded by users on Intemet with huge information. This paper discusses Web data extraction theories and its application technologies, gives a sime-automatic extraction model, and designs a extraction system based on tourism industJy data to prove the feasibility data sime-antomatic extraction.
展开▼