首页>
外国专利>
Finding of titles in the scanned document images, and photo
Finding of titles in the scanned document images, and photo
展开▼
机译:在扫描的文档图像和照片中查找标题
展开▼
页面导航
摘要
著录项
相似文献
摘要
The bitmap image data is analyzed by connected component extraction to identify components or connected components that represent either individual characters or letters, or regions of a nontext image. The connected components are classified as text or nontext based on geometric attributes such as the number of holes, arcs and line ends comprising each component. A nearest-neighbor analysis then identifies which text components represent lines or strings of text and each line or string is further analyzed to determine its vertical or horizontal orientation. Thereafter, separate vertical and horizontal font height filters are used to identify those text strings that are the most likely candidates. For the most likely title candidates a bounding box is defined which can be associated with or overlaid upon the original bitmap data to select the title region for further processing or display. Captions and photographs can also be located.
展开▼