首页>
外国专利>
EXTRACTING CONTENT FROM AS DOCUMENT USING VISUAL INFORMATION
EXTRACTING CONTENT FROM AS DOCUMENT USING VISUAL INFORMATION
展开▼
机译:使用视觉信息从作为文档中提取内容
展开▼
页面导航
摘要
著录项
相似文献
摘要
An aspect of the present invention discloses a method for extracting content from a document. The method includes one or more processors identifying a visual anchor corresponding to a text element depicted in a first document utilizing an edge detection analysis. The method further includes determining edge coordinates of the text element depicted in the first document. The method further includes determining text at a leading edge of the text element depicted in the first document and text at a trailing edge of the text element depicted in the first document, based on the determined edge coordinates. The method further includes extracting a complete version of the text element depicted in the first document, from a plain text version of the first document, utilizing the determined text at the leading edge of the text element and the determined text at the trailing edge of the text element.
展开▼