Invention Grant
- Patent Title: Real-time identification of data candidates for classification based compression
- Patent Title (中): 实时识别基于分类的压缩数据候选
-
Application No.: US14746588Application Date: 2015-06-22
-
Publication No.: US09588980B2Publication Date: 2017-03-07
- Inventor: Jonathan Amit , Lilia Demidov , George Goldberg , Nir Halowani , Ronen I. Kat , Chaim Koifman , Sergey Marenkov , Dmitry Sotnikov
- Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agency: Griffiths & Seaton PLLC
- Main IPC: G06F17/00
- IPC: G06F17/00 ; G06F17/30

Abstract:
Identification of data candidates for data processing is performed in real time by a processor device in a distributed computing environment. Data candidates are sampled for performing a classification-based compression upon the data candidates. A heuristic is computed on a randomly selected data sample from the data candidate, the heuristic computed by, for each one of the data classes, calculating an expected number of characters to be in a data class, calculating an expected number of characters that will not belong to a predefined set of the data classes, and calculating an actual number of the characters for each of the data classes and the non-classifiable data.
Public/Granted literature
- US20150317381A1 REAL-TIME IDENTIFICATION OF DATA CANDIDATES FOR CLASSIFICATION BASED COMPRESSION Public/Granted day:2015-11-05
Information query