Invention Grant
US07644102B2 Methods, systems, and articles of manufacture for soft hierarchical clustering of co-occurring objects
有权
方法,系统和制作物品,用于共同发生物体的软分层聚类
- Patent Title: Methods, systems, and articles of manufacture for soft hierarchical clustering of co-occurring objects
- Patent Title (中): 方法,系统和制作物品,用于共同发生物体的软分层聚类
-
Application No.: US09982236Application Date: 2001-10-19
-
Publication No.: US07644102B2Publication Date: 2010-01-05
- Inventor: Eric Gaussier , Francine Chen , Ashok Chhabedia Popat
- Applicant: Eric Gaussier , Francine Chen , Ashok Chhabedia Popat
- Applicant Address: US CT Norwalk
- Assignee: Xerox Corporation
- Current Assignee: Xerox Corporation
- Current Assignee Address: US CT Norwalk
- Agency: Fay Sharpe LLP
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
Methods, systems, and articles of manufacture consistent with certain principles related to the present invention enable a computing system to perform hierarchical topical clustering of text data based on statistical modeling of co-occurrences of (document, word) pairs. The computing system may be configured to receive a collection of documents, each document including a plurality of words, and perform a modified deterministic annealing Expectation-Maximization (EM) process on the collection to produce a softly assigned hierarchy of nodes. The process may involve assigning documents and document fragments to multiple nodes in the hierarchy based on words included in the documents, such that a document may be assigned to any ancestor node included in the hierarchy, thus eliminating the hard assignment of documents in the hierarchy.
Public/Granted literature
- US20030101187A1 Methods, systems, and articles of manufacture for soft hierarchical clustering of co-occurring objects Public/Granted day:2003-05-29
Information query