首页>
外国专利>
Systems and methods for generating training documents used by classification algorithms
Systems and methods for generating training documents used by classification algorithms
展开▼
机译:用于生成分类算法使用的培训文档的系统和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
The disclosed computer-implemented method for generating training documents used by classification algorithms may include (i) identifying a set of training documents used by a classification system to classify documents written in a first language, (ii) generating a list of tokens from within the training documents that indicate critical terms representative of classes defined by the classification system, (iii) translating the list of tokens from the first language to a second language, (iv) creating, based on the translated tokens, a set of simulated training documents that enables the classification system to classify documents written in the second language, and (v) classifying an additional document written in the second language based on the set of simulated training documents. Various other methods, systems, and computer-readable media are also disclosed.
展开▼