首页>
外国专利>
DATA EXTRACTION ENGINE FOR STRUCTURED, SEMI-STRUCTURED AND UNSTRUCTURED DATA WITH AUTOMATED LABELING AND CLASSIFICATION OF DATA PATTERNS OR DATA ELEMENTS THEREIN, AND CORRESPONDING METHOD THEREOF
DATA EXTRACTION ENGINE FOR STRUCTURED, SEMI-STRUCTURED AND UNSTRUCTURED DATA WITH AUTOMATED LABELING AND CLASSIFICATION OF DATA PATTERNS OR DATA ELEMENTS THEREIN, AND CORRESPONDING METHOD THEREOF
A fully or semi-automated, integrated learning, labeling and classification system and method have closed, self-sustaining pattern recognition, labeling and classification operation, wherein unclassified data sets are selected and converted to an assembly of graphic and text data forming compound data sets that are to be classified. By means of feature vectors, which can be automatically generated, a machine learning classifier is trained for improving the classification operation of the automated system during training as a measure of the classification performance if the automated labeling and classification system is applied to unlabeled and unclassified data sets, and wherein unclassified data sets are classified automatically by applying the machine learning classifier of the system to the compound data set of the unclassified data sets.
展开▼