correlation methods; data compression; document image processing; entropy; feature extraction; image classification; image segmentation; text analysis; automatic correlation-entropy feature extraction; compressed document image analysis; compressed text line characterization; compressed text line segmentation; computing resources; correlation-entropy features; digital libraries; document analysis system; document image analysis system; e-governance applications; feature extraction techniques; nontext component detection; real time systems; run-length compressed TIFF documents; run-length compressed domain; text document analysis; uncompressed images; Image coding; Image segmentation; Compressed document feature extraction; Compressed document image processing; Correlation-entropy analysis; Run-length compressed domain;
机译:可视化CCITT第3组和第4组TIFF文档,并转换为可在压缩域中直接处理的运行时压缩格式
机译:直接在压缩域中的文档图像分析技术综述
机译:深度文本挖掘,用于从文本文档中自动提取关键词
机译:直接在运行长度压缩域中自动提取文本文档分析的相关熵特征
机译:文档中的名词短语:对不同类别的文本进行预处理,自动提取和统计分析。
机译:计算N-gram的对称强度:文本文档自动分类中的两遍过滤方法
机译:提取投影轮廓,运行直方图和熵特征 直接从运行长度压缩文本文档
机译:从技术文本中提取几乎自动语义特征。