首页>
外国专利>
Self-learning based crawling and rule-based data mining for automatic information extraction
Self-learning based crawling and rule-based data mining for automatic information extraction
展开▼
机译:基于自学习的爬网和基于规则的数据挖掘,可自动提取信息
展开▼
页面导航
摘要
著录项
相似文献
摘要
Methods and Systems for automatic information extraction by performing self-learning crawling and rule-based data mining is provided. The method determines existence of crawl policy within input information and performs at least one of front-end crawling, assisted crawling and recursive crawling. Downloaded data set is pre-processed to remove noisy data and subjected to classification rules and decision tree based data mining to extract meaningful information. Performing crawling techniques leads to smaller relevant datasets pertaining to a specific domain from multi-dimensional datasets available in online and offline sources.
展开▼