首页> 外文期刊>International journal of advanced intelligence paradigms >Information retrieval by mining text and image
【24h】

Information retrieval by mining text and image

机译:通过挖掘文本和图像进行信息检索

获取原文
获取原文并翻译 | 示例
       

摘要

We have wonderful scripts which are lying to be digitised in Tamil. Tamil is a language which is enriched with several ancient scripts. Optical character recognition is done in Tamil in order to digitise the scripts. The optical character recognition consists of scanning phase, preprocessing phase, segmentation phase and recognition phase. The retrieved text is stored as an archive in the database. The archive also encompasses the original images. The front end GUI contains the search engine wherein which the keyword is put. The crawler crawls in the database and retrieves the searched page and the image based on context. The retrieved pages will be displayed in the order of relevant context and the appropriate page is clicked and fetched as desired.
机译:我们有出色的脚本,可以用泰米尔语数字化。泰米尔语是一种富含多种古代文字的语言。光学字符识别是在泰米尔语中完成的,以便将脚本数字化。光学字符识别包括扫描阶段,预处理阶段,分割阶段和识别阶段。检索到的文本将作为档案存储在数据库中。档案还包含原始图像。前端GUI包含其中放置关键字的搜索引擎。搜寻器搜寻数据库并根据上下文检索搜索到的页面和图像。检索到的页面将按相关上下文的顺序显示,并根据需要单击并提取相应的页面。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号