Invention Grant
- Patent Title: Miscategorized outlier detection using unsupervised SLM-GBM approach and structured data
-
Application No.: US14861746Application Date: 2015-09-22
-
Publication No.: US10095770B2Publication Date: 2018-10-09
- Inventor: Mingkuan Liu
- Applicant: eBay Inc.
- Applicant Address: US CA San Jose
- Assignee: eBay Inc.
- Current Assignee: eBay Inc.
- Current Assignee Address: US CA San Jose
- Agency: Schwegman Lundberg & Woessner, P.A.
- Main IPC: G06F17/30
- IPC: G06F17/30 ; G06F17/27

Abstract:
In an example, one or more leaf category specific unsupervised statistical language model (SLM) models are trained using sample item listings corresponding to each of one or more leaf categories and structured data about the one or more leaf categories, the training including calculating an expected perplexity and a standard deviation for item listing titles. A perplexity for a title of a particular item listing is calculated and a perplexity deviation signal is generated based on a difference between the perplexity for the title of the particular item listing and the expected perplexity for item listing titles in a leaf category of the particular item listing and based on the standard deviation for item listing titles in the leaf category of the particular item listing. A gradient boosting machine (GBM) fuses the perplexity deviation signal with one or more other signals to generate a miscategorization classification score corresponding to the particular item listing.
Public/Granted literature
- US20170083602A1 MISCATEGORIZED OUTLIER DETECTION USING UNSUPERVISED SLM-GBM APPROACH AND STRUCTURED DATA Public/Granted day:2017-03-23
Information query