Invention Grant
- Patent Title: Large data set negative information storage model
-
Application No.: US15751955Application Date: 2016-08-12
-
Publication No.: US11216442B2Publication Date: 2022-01-04
- Inventor: Jamie K. Teer , Ruizheng Liu , Guillermo Gonzalez-Calderon , Rodrigo Carvajal-Pelaez
- Applicant: H. LEE MOFFITT CANCER CENTER & RESEARCH INSTITUTE, INC.
- Applicant Address: US FL Tampa
- Assignee: H. LEE MOFFITT CANCER CENTER & RESEARCH INSTITUTE, INC.
- Current Assignee: H. LEE MOFFITT CANCER CENTER & RESEARCH INSTITUTE, INC.
- Current Assignee Address: US FL Tampa
- Agency: Meunier Carlin & Curfman LLC
- International Application: PCT/IB2016/054868 WO 20160812
- International Announcement: WO2017/025935 WO 20170216
- Main IPC: G06F16/215
- IPC: G06F16/215 ; G06F16/23 ; G06F16/22 ; G16B50/00 ; G16B30/00 ; G16B50/50 ; G16B30/10

Abstract:
Systems and methods for storing large data sets, such as genetic sequence information. Within a “targeted subset” of positions with information, the system stores, both variant states and missing states at each position. Reference states are not stored, but are inferred within the targeted subset when neither a variant nor a missing state is stored at a given position. The absence of a variant state at a given position is assumed to be a reference state. The criteria for missing data are defined in pre-processing and are customizable based on the use case. For example, each data point may represent the genetic information of a sample at a position in the genome. The targeted subset may represent those positions that were included in a sequencing test.
Public/Granted literature
- US20180239797A1 LARGE DATA SET NEGATIVE INFORMATION STORAGE MODEL Public/Granted day:2018-08-23
Information query