首页>
外国专利>
Miscategorized outlier detection using unsupervised SLM-GBM approach and structured data
Miscategorized outlier detection using unsupervised SLM-GBM approach and structured data
展开▼
机译:使用无监督的SLM-GBM方法和结构数据进行错误分类的异常检测
展开▼
页面导航
摘要
著录项
相似文献
摘要
In an example, one or more leaf category specific unsupervised statistical language model (SLM) models are trained using sample item listings corresponding to each of one or more leaf categories and structured data about the one or more leaf categories, the training including calculating an expected perplexity and a standard deviation for item listing titles. A perplexity for a title of a particular item listing is calculated and a perplexity deviation signal is generated based on a difference between the perplexity for the title of the particular item listing and the expected perplexity for item listing titles in a leaf category of the particular item listing and based on the standard deviation for item listing titles in the leaf category of the particular item listing. A gradient boosting machine (GBM) fuses the perplexity deviation signal with one or more other signals to generate a miscategorization classification score corresponding to the particular item listing.
展开▼