Invention Grant
- Patent Title: Dividing a dataset into sub-datasets having a subset of values of an attribute of the dataset
-
Application No.: US16005889Application Date: 2018-06-12
-
Publication No.: US10552378B2Publication Date: 2020-02-04
- Inventor: Thomas F. Boehme , Andreas Brodt , Namik Hrle , Oliver Schiller
- Applicant: International Business Machines Corporation
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agent Michael O'Keefe
- Main IPC: G06F11/14
- IPC: G06F11/14 ; G06F11/00 ; G06F3/06 ; G06F16/16 ; G06F7/36 ; G06F16/22 ; G06F16/27 ; G06F16/245 ; G06F16/28

Abstract:
Sorting and storing a dataset, the dataset comprising at least one attribute. The method includes defining a set of data blocks and assigning to each data block a predefined maximum number of entries or a predefined maximum amount of storage, dividing the dataset into a sequence of multiple sub-datasets each having one value or a range of values of the attribute, wherein each pair of successive sub-datasets of the sequence are non-overlapping or overlapping at their respective extremum value of the attribute, for each sub-dataset of the multiple sub-datasets: in case the sub-dataset fully or partially fits into a data block of the defined data blocks storing the sub-dataset into at least the data block, the sub-dataset that partially fits into the data block comprising a number of entries that is smaller than a predefined maximum threshold.
Public/Granted literature
- US20180293251A1 METHOD FOR STORING A DATASET Public/Granted day:2018-10-11
Information query