首页> 外国专利> METHOD, APPARATUS AND COMPUTER-READABLE RECORDING MEDIUM FOR TAGGING IMAGE CONTAINED IN WEB PAGE AND PROVIDING WEB SEARCH SERVICE USING TAGGED RESULT

METHOD, APPARATUS AND COMPUTER-READABLE RECORDING MEDIUM FOR TAGGING IMAGE CONTAINED IN WEB PAGE AND PROVIDING WEB SEARCH SERVICE USING TAGGED RESULT

机译:用于标记网页中包含的图像并使用标记结果提供网络搜索服务的方法,装置和计算机可读记录介质

摘要

PURPOSE: A method for providing a web search service by tagging an image included in a webpage and using a tagging result, and a device and a computer-readable recording medium thereof are provided to enhance usability of the webpage by increasing the number of effectively searchable webpages. CONSTITUTION: A crawling unit periodically crawls webpages(S310). An image recognition/analysis unit of a text reader analyzes an image of the webpage by using an image analysis algorithm(S320). A character/symbol converter converts a character or a symbol included in the analyzed image into a machine-readable character or symbol(S330). A markup language convertor converts result information of a read text into a markup language(S340). An interpreter parses contents of web contents converted in a markup language form(S350). A tagger generates tag data by extracting information needed for tagging from the text included in the parsed web contents. The tag data which is matched with the corresponded web contents and the information for relationship is stored to a tag database(S360).
机译:用途:一种通过标记网页中包含的图像并使用标记结果来提供网页搜索服务的方法,以及提供了一种设备和计算机可读记录介质,以通过增加有效搜索次数来增强网页的可用性网页。构成:爬行单元定期爬行网页(S310)。文本阅读器的图像识别/分析单元通过使用图像分析算法来分析网页的图像(S320)。字符/符号转换器将分析的图像中包括的字符或符号转换为机器可读的字符或符号(S330)。标记语言转换器将读取的文本的结果信息转换为标记语言(S340)。解释器解析以标记语言形式转换的网络内容的内容(S350)。标记器通过从解析的Web内容中包含的文本中提取标记所需的信息来生成标记数据。与对应的网页内容和用于联系的信息匹配的标签数据被存储到标签数据库(S360)。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号