Innovative Text Extraction Algorithm Based on TensorFlow

机译：基于Tensorflow的创新文本提取算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Extracting business registration information exploiting graphic recognition algorithms on the Internet nowadays is vital to e-commercial business. However, business registration information is usually presented in graphics and existing graphic recognition systems have been hindered because of their slow detection speed, low accuracy, and complex operations. Thereby, we propose an innovative text extraction algorithm based on TensorFlow (TEAT). We first utilize the web crawler to obtain the data source, and then extract the character information by using our TEAT based on TensorFlow framework recognition technology. Our TEAT algorithm can extract business registration information efficiently and effectively. Comparing with existing text extraction algorithm based on Tess4j framework for extracting Tmall shop business license picture information, our TEAT has obvious advantages over Tess4j framework with higher accuracy and efficiency.

机译：提取商业登记信息现在在互联网上利用图形识别算法对电子商务业务至关重要。但是，商业注册信息通常以图形呈现，并且由于其检测速度，低精度和复杂操作慢，因此已经阻碍了现有的图形识别系统。由此，我们提出了一种基于Tensorflow（乳头）的创新文本提取算法。我们首先利用Web爬网程序获取数据源，然后通过基于Tensorflow框架识别技术使用我们的次来提取字符信息。我们的乳头算法可以有效且有效地提取商业登记信息。与基于TESS4J框架的现有文本提取算法进行比较，用于提取TMALL商店业务许可证信息，我们的乳头在TESS4J框架中具有明显的优点，具有更高的准确性和效率。

著录项

来源
《International Conference on Genetic and Evolutionary Computing》|2019年|xvii 745 p. :|共8页
会议地点
作者
Shichen Zhai; Xiaogang Wang; Di Xiao; Zhiwen Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP393-532;
关键词
OpenCV; Convolutional neural network; Network crawler; Multithread distributed;

机译：OpenCV;卷积神经网络;网络履带;多线程分布;

相似文献

外文文献
中文文献
专利

1. Deep Learning-based Extraction of Algorithmic Metadata in Full-Text Scholarly Documents [J] . Iqra Safder, Saeed-Ul Hassan, Anna Visvizi, Information Processing & Management . 2020,第6期

机译：全文学术文档中算法元数据的深度学习提取
2. TEXT MINING ALGORITHM DISCOTEX (DIS-COVERY FROM TEXT EXTRACTION) WITH INFORMATION EXTRACTION [J] . Dr.T..LALITHA, S.MEENAKSHI Journal of Theoretical and Applied Information Technology . 2014,第2期

机译：具有信息提取功能的文本挖掘算法DISCOTEX（来自文本提取的发现）
3. TEXT MINING ALGORITHM DISCOTEX (DIS-COVERY FROM TEXT EXTRACTION) WITH INFORMATION EXTRACTION [J] . Dr.T..LALITHA, S.MEENAKSHI Journal of Theoretical and Applied Information Technology . 2014,第2期

机译：具有信息提取功能的文本挖掘算法DISCOTEX（来自文本提取的发现）
4. Innovative Text Extraction Algorithm Based on TensorFlow [C] . Shichen Zhai, Xiaogang Wang, Di Xiao, International Conference on Genetic and Evolutionary Computing . 2019

机译：基于Tensorflow的创新文本提取算法
5. Graph-based Algorithms for Keyphrase Extraction in Social Text. [D] . Al-Dhelaan, Mohammed. 2014

机译：基于图的社交文本中关键词提取算法。
6. Validation of the Total Visual Acuity Extraction Algorithm (TOVA) for Automated Extraction of Visual Acuity Data From Free Text Unstructured Clinical Records [O] . Douglas M. Baughman, Grace L. Su, Irena Tsui, -1

机译：从自由文本非结构化临床记录中自动提取视敏度数据的总视敏度提取算法（TOVA）的验证
7. Graph-based Ranking Algorithms for Sentence Extraction, Applied to Text Summarization [O] . 2008

机译：基于图的句子提取排序算法在文本摘要中的应用

Innovative Text Extraction Algorithm Based on TensorFlow

摘要

著录项

相似文献

相关主题

期刊订阅