Invention Grant
US07991578B2 Method and apparatus for finding cluster in data stream as infinite data set having data objects to be continuously generated
有权
用于在具有要连续生成数据对象的无限数据集的情况下发现数据流中的簇的方法和装置
- Patent Title: Method and apparatus for finding cluster in data stream as infinite data set having data objects to be continuously generated
- Patent Title (中): 用于在具有要连续生成数据对象的无限数据集的情况下发现数据流中的簇的方法和装置
-
Application No.: US12038649Application Date: 2008-02-27
-
Publication No.: US07991578B2Publication Date: 2011-08-02
- Inventor: Won-Suk Lee
- Applicant: Won-Suk Lee
- Priority: KR10-2007-0110099 20071031
- Main IPC: G06F17/18
- IPC: G06F17/18 ; G06F19/00

Abstract:
A method of finding a cluster in a data stream includes updating statistical distribution information of a grid-cell corresponding to a currently generated data element, statistical distribution information on previously generated data elements being managed using grid-cells, which are partitioned within the range of a data space and have statistical distribution information of data elements within the range; comparing the occurrence frequency of the data element in the grid-cell according to the update result with a predefined partitioning threshold, partitioning the grid-cell into a plurality of grid-cells according to the comparison result, and estimating statistical distribution information of the partitioned grid-cells; recursively performing the updating or comparing step until the grid-cell becomes a unit grid-cell; and comparing the occurrence frequency of a data element in the unit grid-cell with a predefined minimum support and defining a set of unit grid-cells as a cluster according to the comparison result.
Public/Granted literature
Information query