首页>
外国专利>
WEBPAGE TEXT EXTRACTION METHOD AND DEVICE, AND WEBPAGE ADVERTISEMENT HANDLING METHOD AND DEVICE
WEBPAGE TEXT EXTRACTION METHOD AND DEVICE, AND WEBPAGE ADVERTISEMENT HANDLING METHOD AND DEVICE
展开▼
机译:网页文本提取方法和装置,以及网页广告处理方法和装置
展开▼
页面导航
摘要
著录项
相似文献
摘要
Disclosed are a webpage text extraction method and device, and webpage advertisement handling method and device, the webpage text extraction method comprising: reading webpage data, determining interference data contained in the webpage data, and replacing the interference data with null characters; recording the line number of each line on a webpage and the number of words in the corresponding line; determining the webpage text by utilizing the line number of each line and the word total of the corresponding line; and extracting the webpage text. Compared with the prior art, the present invention does not depend on a browser environment and page structure, and has good expandability.
展开▼