首页>
外国专利>
System and method for smoothing hierarchical data using isotonic regression
System and method for smoothing hierarchical data using isotonic regression
展开▼
机译:使用等渗回归平滑分层数据的系统和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
An improved system and method is provided for detecting a web page template. A web page template detector may be provided for performing page-level template detection on a web page. In general, the web page template classifier may be trained using automatically generated training data, and then the web page template classifier may be applied to web pages to identify web page templates. A web page template may be detected by classifying segments of a web page as template structures, by assigning classification scores to the segments of the web page classified as template structures, and then by smoothing the classification scores assigned to the segments of the web page. Generalized isotonic regression may be applied for smoothing scores associated with the nodes of a hierarchy by minimizing an optimization function using dynamic programming.
展开▼