Invention Grant
- Patent Title: System and method for clustering content according to similarity
- Patent Title (中): 根据相似性对内容进行聚类的系统和方法
-
Application No.: US14013902Application Date: 2013-08-29
-
Publication No.: US09026518B2Publication Date: 2015-05-05
- Inventor: Ned Rhinelander , Clifford Lyon
- Applicant: CBS Interactive Inc.
- Applicant Address: US CA San Francisco
- Assignee: CBS Interactive Inc.
- Current Assignee: CBS Interactive Inc.
- Current Assignee Address: US CA San Francisco
- Agency: Reed Smith LLP
- Agent Marc S. Kaufman
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
Systems and methods for clustering content according to similarity are provided that identify and group similar content using a set of tags associated with the content. A topic model of a group of content is built, producing a probability distribution of topic membership for the content. Individual items of content are then clustered using a clustering algorithm, and a distance matrix from the probability distribution is built. Based on the distance matrix, individual items of content are labeled as “must-link” or “cannot-link” pairs with the group of content. The topic model is then embedded into successively smaller dimensions using a kernel method, until the clustering is stable with respect to both the behavioral and content domains.
Public/Granted literature
- US20140250127A1 SYSTEM AND METHOD FOR CLUSTERING CONTENT ACCORDING TO SIMILARITY Public/Granted day:2014-09-04
Information query