首页>
外国专利>
Document data classification using a noise-to-content ratio
Document data classification using a noise-to-content ratio
展开▼
机译:使用噪声含量比对文档数据进行分类
展开▼
页面导航
摘要
著录项
相似文献
摘要
A method and system for classifying document data is described. The method may include classifying a first portion of an electronic document as substantive content or noise, classifying a second portion of the electronic document as substantive content or noise, determining a first feature of the first portion of the electronic document indicative of substantive content using a machine learning algorithm, and determining a second feature of the second portion of the electronic document indicative of noise using the machine learning algorithm.
展开▼